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Abstract 



We develop an algorithmic theory of convex optimization over discrete sets. Using 
a combination of algebraic and geometric tools we are able to provide polynomial time 
algorithms for solving broad classes of convex combinatorial optimization problems and 
convex integer programming problems in variable dimension. We discuss some of the many 
applications of this theory including to quadratic programming, matroids, bin packing 
and cutting-stock problems, vector partitioning and clustering, multiway transportation 
problems, and privacy and confidential statistical data disclosure. Highlights of our work 
include a strongly polynomial time algorithm for convex and linear combinatorial opti- 
mization over any family presented by a membership oracle when the underlying polytope 
has few edge-directions; a new theory of so-termed n-fold integer programming, yielding 
polynomial time solution of important and natural classes of convex and linear integer 
programming problems in variable dimension; and a complete complexity classification 
of high dimensional transportation problems, with practical applications to fundamental 
problems in privacy and confidential statistical data disclosure. 
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1 Introduction 



The general linear discrete optimization problem can be posed as follows. 

Linear discrete optimization. Given a set 5 C Z" of integer points and an integer 
vector t(7 G Z", find an x E S maximizing the standard inner product wx := Y17=i '^i^i- 

The algorithmic complexity of this problem, which includes integer programming and 
combinatorial optimization as special cases, depends on the presentation of the set S of 
feasible points. In integer programming, this set is presented as the set of integer points 
satisfying a given system of linear inequalities, which in standard form is given by 

S = {xeW : Ax = h} , 

where N stands for the nonnegative integers, A G Z"^^" is an m x n integer matrix, 
and h G Z™ is an integer vector. The input for the problem then consists of A^h^w. 
In combinatorial optimization, S C {0, 1}" is a set of {0, l}-vectors, often interpreted as 
a family of subsets of a ground set N :— {1, . . . ,n}, where each x E S is the indicator 
of its support supp(a;) C N. The set S is presented implicitly and compactly, say as 
the set of indicators of subsets of edges in a graph G satisfying a given combinatorial 
property (such as being a matching, a forest, and so on), in which case the input is G,w. 
Alternatively, S is given by an oracle, such as a membership oracle which, queried on 
X G {0, l}*^', asserts whether or not x E S, in which case the algorithmic complexity also 
includes a count of the number of oracle queries needed to solve the problem. 

Here we study the following broad generalization of linear discrete optimization. 

Convex discrete optimization. Given a set 5 C Z", vectors wi, . . . E Z", and a 
convex functional c : M.^ — > M, find an x E S maximizing c{wix, . . . , Wdx). 

This problem can be interpreted as multi-objective linear discrete optimization: given d 
linear functionals WiX, . . . , WdX representing the values of points x E S under d criteria, 
the goal is to maximize their "convex balancing" defined by c{wix, . . . ,Wdx). In fact, we 
have a hierarchy of problems of increasing generality and complexity, parameterized by 
the number d of linear functionals: at the bottom lies the linear discrete optimization 
problem, recovered as the special case of d = 1 and c the identity on M; and at the top lies 
the problem of maximizing an arbitrary convex functional over the feasible set S, arising 
with d = n and with Wj = 1, the i-th standard unit vector in M" for all i. 

The algorithmic complexity of the convex discrete optimization problem depends on 
the presentation of the set S of feasible points as in the linear well as on the 

presentation of the convex functional c. When S is presented as the set of integer points 
satisfying a given system of linear inequalities we also refer to the problem as convex 
integer programming, and when S C {0, 1}" and is presented implicitly or by an oracle 
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we also refer to the problem as convex combinatorial optimization. As for the convex 
functional c, we will assume throughout that it is presented by a com,parison oracle that, 
queried onx,y & M.'^, asserts whether or not c{x) < c{y). This is a very broad presentation 
that reveals little information on the function, making the problem, on the one hand, very 
expressive and applicable, but on the other hand, very hard to solve. 

There is a massive body of knowledge on the complexity of linear discrete optimization 
- in particular (linear) integer programming [55] and (linear) combinatorial optimization 
[31]. The purpose of this monograph is to provide the first comprehensive unified treat- 
ment of the extended convex discrete optimization problem. The monograph follows the 
outline of five lectures given by the author in the Seminaire de Mathematiques Superieures 
Series, Universite dc Montreal, during June 2006. Colorful slides of theses lectures are 
available online at [46] and can be used as a visual supplement to this monograph. The 
monograph has been written under the support of the ISF - Israel Science Foundation. 
The theory developed here is based on and is a culmination of several recent papers in- 
cluding [5, 12, 13, 14, 15, 16, 17, 25, 39, 47, 48, 49, 50, 51] written in collaboration with 
several colleagues - Eric Babson, Jesus De Loera, Komei Pukuda, Raymond Hemmecke, 
Frank Hwang, Vera Rosta, Uriel Rothblum, Leonard Schulman, Bernd Sturmfels, Rekha 
Thomas, and Robert Weismantel. By developing and using a combination of geomet- 
ric and algebraic tools, we are able to provide polynomial time algorithms for several 
broad classes of convex discrete optimization problems. We also discuss in detail some of 
the many applications of our theory, including to quadratic programming, matroids, bin 
packing and cutting-stock problems, vector partitioning and clustering, multiway trans- 
portation problems, and privacy and confidential statistical data disclosure. 

We hope that this monograph will, on the one hand, allow users of discrete optimiza- 
tion to enjoy the new powerful modelling and expressive capability of convex discrete 
optimization along with its broad polynomial time solvability, and on the other hand, 
stimulate more research on this new and fascinating class of problems, their complexity, 
and the study of various relaxations, bounds, and approximations for such problems. 

1.1 Limitations 

Convex discrete optimization is generally intractable even for small fixed d, since already 
for (i = 1 it includes linear integer programming which is NP-hard. When is a variable 
part of the input, even very simple special cases are NP-hard, such as the following 
problem, so-called positive semi-definite quadratic binary programming, 

max {(wix)^ -|- • — h {wnxy : x e N" , Xi < 1 , i — 1, . . . ,n} . 

Therefore, throughout this monograph we will assume that d is fixed (but arbitrary). 
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As explained above, we also assume throughout that the convex functional c which 
constitutes part of the data for the convex discrete optimization problem is presented by 
a comparison oracle. Under such broad presentation, the problem is generally very hard. 
In particular, if the feasible set is := {a; G : Ax = b} and the underlying polyhedron 
P :— {x & : Ax — b} is unbounded, then the problem is inaccessible even in one 
variable with no equation constraints. Indeed, consider the following family of univariate 
convex integer programs with convex functions parameterized by — oo < w < oo. 



Consider any algorithm attempting to solve the problem and let u be the maximum value 
of X in all queries to the oracle of c. Then the algorithm can not distinguish between 
the problem with c„, whose objective function is unbounded, and the problem with Coo, 
whose optimal objective value is 0. Thus, convex discrete optimization (with an oracle 
presented functional) over an infinite set S C Z" is quite hopeless. Therefore, an algo- 
rithm that solves the convex discrete optimization problem will either return an optimal 
solution, or assert that the problem is infeasible, or assert that the underlying polyhedron 
is unbounded. In fact, in most applications, such as in combinatorial optimization with 
S ^ {0,1}" or integer programming with S* := {x G Z" : Ax = b, I < x < u} and 
/, M G Z**, the set S is finite and the problem of unboundedness does not arise. 

1.2 Outline and Overview of Main Results and Applications 

We now outline the structure of this monograph and provide a brief overview of what we 
consider to be our main results and main applications. The precise relevant definitions 
and statements of the theorems and corollaries mentioned here are provided in the rel- 
evant sections in the monograph body. As mentioned above, most of these results are 
adaptations or extensions of results from one of the papers [5, 12, 13, 14, 15, 16, 17, 25, 
39, 47, 48, 49, 50, 51]. The monograph gives many more applications and results that may 
turn out to be useful in future development of the theory of convex discrete optimization. 

The rest of the monograph consists of five sections. While the results evolve from 
one section to the next, it is quite easy to read the sections independently of each other 
(while just browsing now and then for relevant definitions and results). Specifically, 
Section 3 uses definitions and the main result of Section 2; Section 5 uses definitions and 
results from Sections 2 and 4; and Section 6 uses the main results of Sections 4 and 5. 

In Section 2 we show how to reduce the convex discrete optimization problem over 
S C Z" to strongly polynomially many linear discrete optimization counterparts over S, 
provided that the convex hull conv(5') satisfies a suitable geometric condition, as follows. 




—X, ii X < u; 
X — 2u, ii X >u. 
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Theorem 2.4 For every fixed d, the convex discrete optimization problem over any finite 
S dWr' presented by a linear discrete optimization oracle and endowed with a set covering 
all edge-directions of cony^S) , can be solved in strongly polynomial time. 

This resuh will be incorporated in the polynomial time algorithms for convex combinato- 
rial optimization and convex integer programming to be developed in §3 and §5. 

In Section 3 we discuss convex combinatorial optimization. The main result is that 
convex combinatorial optimization over a set S C {0, 1}" presented by a membership ora- 
cle can be solved in strongly polynomial time provided it is endowed with a set covering all 
edge-directions of conv(5'). In particular, the standard linear combinatorial optimization 
problem over S can be solved in strongly polynomial time as well. 

Theorem 3.5 For every fixed d, the convex combinatorial optimization problem over 
any S C {0, 1}" presented by a membership oracle and endowed with a set covering all 
edge- directions of the polytope conv(5'), can he solved in strongly polynomial time. 

An important application of Theorem 3.5 concerns convex matroid optimization. 

Corollary 3.11 For every fi,xed d, convex combinatorial optimization over the family of 
bases of a matroid presented by membership oracle is strongly polynomial time solvable. 

In Section 4 we develop the theory of linear n-fold integer programming. As a conse- 
quence of this theory we are able to solve a broad class of linear integer programming prob- 
lems in variable dimension in polynomial time, in contrast with the general intractability 
of linear integer programming. The main theorem here may seem a bit technical at a first 
glance, but is really very natural and has many applications discussed in detail in §4, §5 
and §6. To state it we need a definition. Given an (r + s) x t matrix A, let Ai be its 
r xt sub-matrix consisting of the first r rows and let A2 be its s x t sub-matrix consisting 
of the last s rows. We refer to A explicitly as (r -|- s) x i matrix., since the definition 
below depends also on r and s and not only on the entries of A. The n-fold matrix of an 
(r -|- s) xt matrix A is then defined to be the following (r -|- ns) x nt matrix. 



(i„ ® Ai) e {In ® A2) 



(A, 


A, 


A, ■ 









• 


■ 





A2 


• 


• 



v 







A2 ) 

= Zl±l{±oo}, right- 



Given now any n G N, lower and upper bounds Z, w e with 
hand side h G Z''^"*. and linear functional wx with w G Z"*, the corresponding linear 
n-fold integer programming problem is the following program in variable dimension nt, 

max{wx : x G lT-\ A^'^^x = 6, l<x<u\ . 
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The main theorem of §4 asserts that such integer programs are polynomial time solvable. 

Theorem 4.11 For every fixed (r + s) x i integer matrix A, the linear n-fold integer 
programming problem with any n, I, u, h, and w can he solved in polynomial time. 

Theorem 4.11 has very important applications to high-dimensional transportation prob- 
lems which are discussed in §4.5.1 and in more detail in §6. Another major application 
concerns bin packing problems, where items of several types are to be packed into bins so 
as to maximize packing utility subject to weight constraints. This includes as a special 
case the classical cutting-stock problem of [27]. These are discussed in detail in §4.5.2. 

Corollary 4.15 For every fixed number t of types and type weights Vi, . . . , Vt, the corre- 
sponding integer bin packing and cutting-stock problems are polynomial time solvable. 

In Section 5 we discuss convex integer programming, where the feasible set S is 
presented as the set of integer points satisfying a given system of linear inequalities. 

In particular, we consider convex integer programming over n-fold systems for any fixed 
(but arbitrary) (r + s) x t matrix A, where, given n G N, vectors /, m e Z^, b G U^'^^ and 
Wi, . . . , G Z"*, and convex functional c : M*^ — >• M, the problem is 

max{c(wix, . . . , w^x) : x G Z"**, A^'^^x = 6, I < x < u} . 

The main theorem of §5 is the following extension of Theorem 4.11, asserting that convex 
integer programming over n-fold systems is polynomial time solvable as well. 

Theorem 5.5 For every fixed d and {r -\- s) x t integer matrix A, convex n-fold integer 
programming with any n, I, u, b, wi, . . . , wa, and c can be solved in polynomial time. 

Theorem 5.5 broadly extends the class of objective functions that can be efficiently max- 
imized over n-fold systems. Thus, all applications discussed in §4.5 automatically extend 
accordingly. These inchidc convex high-dimensional transportation problems and convex 
bin packing and cutting-stock problems, which are discussed in detail in §5.4.1 and §6. 

Another important application of Theorem 5.5 concerns vector partitioning problems 
which have applications in many areas including load balancing, circuit layout, ranking, 
cluster analysis, inventory, and rehabihty, see e.g. [7, 9, 25, 39, 50] and the references 
therein. The problem is to partition n items among p players so as to maximize social 
utility. With each item is associated a A;-dimensional vector representing its utility under 
k criteria. The social utility of a partition is a convex function of the sums of vectors 
of items that each player receives. In the constrained version of the problem, there are 
also restrictions on the number of items each player can receive. We have the following 
consequence of Theorem 5.5; more details on this application are in §5.4.2. 

CoroIIciry 5.10 For every fixed number p of players and number k of criteria, the con- 
strained and unconstrained vector partition problems with any item vectors, convex utility. 
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and constraints on the number of item per player, are polynomial time solvable. 

In the last Section 6 we discuss multiway (high-dimensional) transportation problems 

and secure statistical data disclosure. Multiway transportation problems form a very 
important class of discrete optimization problems and have been used and studied exten- 
sively in the operations research and mathematical programming literature, as well as in 
the statistics literature in the context of secure statistical data disclosure and management 
by public agencies, see e.g. [4, 6, 11, 18, 19, 42, 43, 53, 60, 62] and the references therein. 
The feasible points in a transportation problem are the multiway tables ("contingency 
tables" in statistics) such that the sums of entries over some of their lower dimensional 
sub-tables such as lines or planes ("margins" in statistics) are specified. We completely 
settle the algorithmic complexity of treating multiway tables and discuss the applications 
to transportation problems and secure statistical data disclosure, as follows. 

In §6.2 we show that "short" 3- way transportation problems, over r x c x 3 tables with 
variable number r of rows and variable number c of columns but fixed small number 3 of 
layers (hence "short"), are universal in that every integer programming problem is such 
a problem (see §6.2 for the precise stronger statement and for more details). 

Theorem 6.1 Every linear integer programming problem m&x{cy : y e N" : Ay = b} is 
polynomial time representable as a short 3-way line-sum transportation problem 

max{ : x e W'"'''^ : ^ Xij^k = , ^ Xij^k = , ^'-^^ ^ '^''^ ^ ' 

i j k 

In §6.3 we discuss A;-way transportation problems of any dimension k. We provide the 
first polynomial time algorithm for convex and linear "long" (k + l)-way transportation 
problems, over mi x ■ ■ ■ x rrik x n tables, with k and mi, . . . ,mfc fixed (but arbitrary), 
and variable number n of layers (hence "long"). This is best possible in view of Theorem 
6.1. Our algorithm works for any hierarchical collection of margins: this captures common 
margin collections such as all line-sums, all plane-sums, and more generally all /i-flat sums 
for any < h < k (see §6.1 for more details). We point out that even for the very special 
case of linear integer transportation over 3 x 3 x n tables with specified line-sums, our 
polynomial time algorithm is the only one known. We prove the following statement. 

Corollary 6.4 For every fixed d, k, mi, . . . , m^ and family T of subsets of {1, . . . , k + 1} 
specifying a hierarchical collection of margins, the convex ( and in particular linear) long 
transportation problem over mi x • • • x x n tables is polynomial time solvable. 

In our last subsection §6.4 we discuss an important application concerning privacy 
in statistical databases. It is a common practice in the disclosure of a multiway table 
containing sensitive data to release some table margins rather than the table itself. Once 
the margins are released, the security of any specific entry of the table is related to the 
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set of possible values that can occur in that entry in any table having the same margins 
as those of the source table in the data base. In particular, if this set consists of a 
unique value, that of the source table, then this entry can be exposed and security can be 
violated. We show that for multiway tables where one category is significantly richer than 
the others, that is, when each sample point can take many values in one category and only 
few values in the other categories, it is possible to check entry-uniqueness in polynomial 
time, allowing disclosing agencies to make learned decisions on secure disclosure. 

Corollary 6.6 For every fixed k, mi, . . . , ruk and family T of subsets of {1, . . . , k + 1} 

specifying a hierarchical collection of margins to be disclosed, it can be decided in polyno- 
mial time whether any specified entry a;j^,...,ij.^^ is the same in all long mi x • • • x m^ x n 
tables with the disclosed margins, and hence at risk of exposure. 

1.3 Terminology and Complexity 

We use M for the reals, R_|_ for the nonnegative reals, Z for the integers, and N for the 
nonnegative integers. The sign of a real number r is denoted by sign(r) G {0, —1, 1} and 
its absolute value is denoted by |r|. The i-th standard unit vector in is denoted by Ij. 
The support of a; G M" is the index set supp(x) := {i : x,j ^ 0} of nonzero entries of x. The 
indicator of a subset / C {0, 1}" is the vector 1/ := Yli&i ^'^ ^^at supp(l/) = /. When 
several vectors are indexed by subscripts, Wi,...,Wii G M", their entries are indicated 
by pairs of subscripts, Wi = (wj^i, . . . , When vectors are indexed by superscripts, 

x^,. . . ,x'' G M", their entries are indicated by subscripts, = {x\, . . . , x^). The integer 
lattice Z" is naturally embedded in R". The space M"^ is endowed with the standard 
inner product which, for w,x E M", is given by wx := Yl^=i'^i^i- Vectors w in M" will 
also be regarded as linear functionals on M" via the inner product wx. Thus, we refer to 
elements of as points, vectors, or linear functionals, as will be appropriate from the 
context. The convex hull of a set -S" C R" is denoted by conv(5') and the set of vertices of 
a polyhedron P C R" is denoted by vcrt(P). In linear discrete optimization over S C Z"', 
the facets of conv(S') play an important role, see Chvatal [10] and the references therein 
for earlier work, and Grotschel, Lovasz and Schrijver [31, 45] for the later culmination in 
the equivalence of separation and linear optimization via the ellipsoid method of Yudin 
and Nemirovskii [63]. As will turn out in §2, in convex discrete optimization over S, the 
edges of conv(S') play an important role (most significantly in a way which is not related 
to the Hirsch conjecture discussed in [41]). We therefore use extensively convex polytopes, 
for which we follow the terminology of [32, 65]. 

We often assume that the feasible set S C Z^' is finite. We then define its radius to be 
its loo radius p{S) :— max{||a:;||oo : x & S} where, as usual, || \. In other 

words, p{S) is the smallest p G N such that S is contained in the cube [— p, p]". 

Our algorithms are applied to rational data only, and the time complexity is as in 
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the standard Turing machine model, see e.g. [1, 26, 55]. The input typicaUy consists of 
rational (usually integer) numbers, vectors, matrices, and finite sets of such objects. The 
binary length of an integer number 2; G Z is defined to be the number of bits in its binary 
representation, (z) := 1 + flog2(|-2| + 1)] (with the extra bit for the sign). The length of 
a rational number presented as a fraction r — ^ with p,q & 'Z is (r) := (p) + (g). The 
length of an m X n matrix A (and in particular of a vector) is the sum (A) :— 
of the lengths of its entries. Note that the length of A is no smaller than the number 
of entries, (A) > mn. Therefore, when A is, say, part of an input to an algorithm, 
with m,n variable, the length (A) already incorporates mn, and so we will typically not 
account additionally for m, n directly. But sometimes, especially in results related to 
n-fold integer programming, we will also emphasize n as part of the input length. Simi- 
larly, the length of a finite set E of numbers, vectors or matrices is the sum of lengths of 
its elements and hence, since (E) > \E\, automatically accounts for its cardinality. 

Some input numbers affect the running time of some algorithms through their unary 
presentation, resulting in so-called "pseudo polynomial" running time. The unary length 
of an integer number 2; e Z is the number l-zj -|- 1 of bits in its unary representation (again, 
an extra bit for the sign). The unary length of a rational number, vector, matrix, or finite 
set of such objects are defined again as the sums of lengths of their numerical constituents, 
and is again no smaller than the number of such numerical constituents. 

When studying convex and linear integer programming in §4 and §5 we sometimes 
have lower and upper bound vectors l,u with entries in Zoo := Z W {±00}. Both binary 
and unary lengths of a ±00 entry are constant, say 3 by encoding ±00 := ±"00". 

To make the input encoding precise, we introduce the following notation. In ev- 
ery algorithmic statement we describe explicitly the input encoding, by listing in square 
brackets all input objects affecting the running time. Unary encoded objects are listed 
directly whereas binary encoded objects are listed in terms of their length. For example, 
as is often the case, if the input of an algorithm consists of binary encoded vectors (linear 
functional) Wi, . . . , G Z" and unary encoded integer p G N (bounding the radius p{S) 
of the feasible set) then we will indicate that the input is encoded as [p, {wi, . . . , Wd)]. 

Some of our algorithms are strongly polynomial time in the sense of [59]. For this, part 
of the input is regarded as "special" . An algorithm is then strongly polynomial time if it 
is polynomial time in the usual Turing sense with respect to all input, and in addition, 
the number of arithmetic operations (additions, subtractions, multiplications, divisions, 
and comparisons) it performs is polynomial in the special part of the input. To make this 
precise, we extend our input encoding notation above by splitting the square bracketed 
expression indicating the input encoding into a "left" side and a "right" side, separated 
by semicolon, where the entire input is described on the right and the special part of the 
input on the left. For example. Theorem 2.4, asserting that the algorithm underlying it is 
strongly polynomial with data encoded as [n, \ E\; {p{S),Wi, . . . , w^, E)], where p{S) G N, 
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Wi,...,Wfi G Z" and E C Z", means that the running time is polynomial in the binary 
length of p{S), Wi, . . . , w^, and E, and the number of arithmetic operations is polynomial 
in n and the cardinality \E\, which constitute the special part of the input. 

Often, as in [31], part of the input is presented by oracles. Then the running time and 
the number of arithmetic operations count also the number of oracle queries. An oracle 
algorithm is polynomial time if its running time, including the number of oracle queries, 
and the manipulations of numbers, some of which are answers to oracle queries, is polyno- 
mial in the length of the input encoding. An oracle algorithm is strongly polynomial time 
(with specified input encoding as above), if it is polynomial time in the entire input (on 
the "right"), and in addition, the number of arithmetic operations it performs (including 
oracle queries) is polynomial in the special part of the input (on the "left" ) . 
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2 Reducing Convex to Linear Discrete Optimization 



In this section we show that when suitable auxihary geometric information about the 
convex hull conv(5') of a finite set 5" C Z" is available, the convex discrete optimiza- 
tion problem over S can be reduced to the solution of strongly polynomially many linear 
discrete optimization counterparts over S. This result will be incorporated into the poly- 
nomial time algorithms developed in §3 and §5 for convex combinatorial optimization 
and convex integer programming respectively. In §2.1 we provide some preliminaries on 
edge-directions and zonotopes. In §2.2 we prove the reduction which is the main result of 
this section. In §2.3 we prove a pseudo polynomial reduction for any finite set. 

2.1 Edge-Directions and Zonotopes 

We begin with some terminology and facts that play an important role in the sequel. A 
direction of an edge (1-dimensional face) e = [u, v] of a polytope P is any nonzero scalar 
multiple ofu — v. A set of vectors E covers all edge-directions of P if it contains a direction 
of each edge of P. The normal cone of a polytope P C M" at its face F is the (relatively 
open) cone Cp of those linear functionals h e MP- which are maximized over P precisely 
at points of F. A polytope Z is a refinement of a polytope P if the normal cone of every 
vertex of Z is contained in the normal cone of some vertex of P. If Z refines P then, 
moreover, the closure of each normal cone of P is the union of closures of normal cones 
of Z. The zonotope generated by a set of vectors E = {ci, . . . , e^} in M*^ is the following 
polytope, which is the projection by E of the cube [—1, 1]™ into M"*, 

Z := zone{E) := conv |^ : = ±l| C R'^ . 

The following fact goes back to Minkowski, see [32]. 

Lemma 2.1 Let P be a polytope and let E be a finite set that covers all edge-directions 
of P. Then the zonotope Z :— zone{E) generated by E is a refinement of P. 

Proof. Consider any vertex u of Z. Then u — J2eeE "^e^ suitable Ae = ±1. Thus, the 
normal cone consists of those h satisfying hXge > for all e. Pick any h e and 
let V he a vertex of P at which h is maximized over P. Consider any edge [v, w] of P. 
Then v — w = a^^e for some scalar Oe 7^ and some e & E, and < h{v — w) = haeC, 
implying agAg > 0. It follows that every h e satisfies h{v — w) > for every edge of 
P containing v. Therefore h is maximized over P uniquely at v and hence is in the cone 
Cp oi P a,t V. This shows C| C Cp. Since u was arbitrary, it follows that the normal 
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cone of every vertex of Z is contained in the normal cone of some vertex of P. □ 

The next lemma provides bounds on the number of vertices of any zonotope and on 
the algorithmic complexity of constructing its vertices, each vertex along with a linear 
functional maximized over the zonotope uniquely at that vertex. The bound on the 
number of vertices has been rediscovered many times over the years. An early reference 
is [33], stated in the dual form of 2-partitions. A more general treatment is [64]. Recent 
extensions to p-partitions for any p are in [3, 39], and to Minkowski sums of arbitrary 
polytopes are in [29]. Interestingly, already in [33], back in 1967, the question was raised 
about the algorithmic complexity of the problem; this is now settled in [20, 21] (the latter 
reference correcting the former) . We state the precise bounds on the number of vertices 
and arithmetic complexity, but will need later only that for any fixed d the bounds are 
polynomial in the number of generators. Therefore, below we only outline a proof that 
the bounds are polynomial. Complete details are in the above references. 

Lemma 2.2 The number of vertices of any zonotope Z := zone{E) generated by a set 
E of m vectors in M'^ is at most 2 i'"k ^)- ^^"^ every fixed d, there is a strongly 

polynomial time algorithm that, given E C , encoded as [m :— \E\] {E)], outputs every 
vertex v of Z :— zone{E) along with a linear functional /i„ e maximized over Z 
uniquely at v, using 0{m'^~^) arithmetics operations for d > 3 and 0{m'^) for d <2. 

Proof. We only outline a proof that, for every fixed d, the polynomial bounds 0{m'^~^) on 
the number of vertices and 0{m'^) on the arithmetic complexity hold. We assume that E 
linearly spans R*^ (else the dimension can be reduced) and is generic, that is, no d points 
of E lie on a linear hyperplane (one containing the origin). In particular, ^ E. The 
same bound for arbitrary E then follows using a perturbation argument (cf. [39]). 

Each oriented hnear hyperplane H — {x eM.^ : hx — 0} with h eW' nonzero induces 
a partition of E by E ^ //- (+J (+) //+, with H' := {e e E : he < 0}, E^ -.^ E D H, 
and := {e E E : he > 0}. The vertices oi Z = zone(i?) are in bijection with ordered 
2-partitions of E induced by such hyperplanes that avoid E. Indeed, if ii^ = H [+J 
then the linear functional hy := h defining H is maximized over Z uniquely at the vertex 
V := E{e : e G H+} - ^{e : e G H'} of Z. 

We now show how to enumerate all such 2-partitions and hence vertices of Z. Let M 
be any of the (J^^) subsets of E of size d—1. Since E is generic, M is linearly independent 

and spans a unique linear hyperplanc lin(Af). Let H = {x E M.'^ : hx = 0} he one of 
the two orientations of the hyperplane lin(M). Note that = M. Finally, let L be 
any of the 2'^~^ subsets of M. Since M is linearly independent, there is a G M"' which 
linearly separates L from M \ L, namely, satisfies gx < for all x G L and gx > for 
all x G M \ L. Furthermore, there is a sufficiently small e > such that the oriented 
hyperplane H :— {x E M.^ : hx — 0} defined by h :— h + eg avoids E and the 2-partition 
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induced by H satisfies H = [+J L and = [+J(M\ L). The corresponding vertex 
of Z is v := X]{e : e e H^} — ^{e : e G and tlie corresponding linear functional 
which is maximized over Z uniquely dX v \s := h = h + eg. 

We claim that any ordered 2-partition arises that way from some M, some orientation 
H of lin(M), and some L. Indeed, consider any oriented linear hyperplane H avoiding 
E. It can be perturbed to a suitable oriented H that touches precisely d — 1 points of 
E. Put M := so that H coincides with one of the two orientations of the hyperplane 
lin(M) spanned by M, and put L := H fl M . Let H be an oriented hyperplane obtained 
from M, H and L by the above procedure. Then the ordered 2-partition E = H~ [+J 
induced by H coincides with the ordered 2-partition E — H~ l+J H'^ induced by H. 

Since there are (J^^) many (d — l)-subsets MCE, two orientations H of lin(M), 
and 2'^~^ subsets L C M, and d is fixed, the total number of 2-partitions and hence also 
the total number of vertices of Z obey the upper bound 2'^(^™^) = 0(m'^~^). Further- 
more, for each choice of M, H and L, the linear functional h defining H, as well as g, e, 
hy = h = h + eg, and the vertex v = ^{e : e e H~^} — X^{e : e G H~} of Z at which 
hy is uniquely maximized over Z, can all be computed using 0{m) arithmetic operations. 
This shows the claimed bound 0{m^) on the arithmetic complexity. □ 

We conclude with a simple fact about edge-directions of projections of polytopes. 

Lemma 2.3 If E covers all edge-directions of a polytope P, and Q :— oj{P) is the image 
of P under a linear map uj : — > W^, then uj{E) covers all edge- directions of Q. 

Proof. Let / be a direction of an edge [x, y] of Q. Consider the face F :— uj~^{[x, y]) of P. 
Let V be the set of vertices of F and let C/ — {u &V : ijj{u) — x}. Then for some u E U 

and V E V \ U, there must be an edge [u,v] of F, and hence of P. Then lj{v) E {x,y] 
hence uj{v) = x + af for some a 7^ 0. Therefore, with e := ^{v — u), a direction of the 
edge [u,v] of P, we find that / = -{u!{v) — u!{u)) = a;(e) G u}{E). □ 



2.2 Strongly Polynomial Reduction of Convex to Linear Dis- 
crete Optimization 

A linear discrete optimization oracle for a set 5* C Z" is one that, queried on w G Z", 
either returns an optimal solution to the linear discrete optimization problem over S, that 
is, an x* G satisfying wx* — maxima; : a; G 5"}, or asserts that none exists, that is, 
either the problem is infeasible or the objective function is unbounded. We now show that 
a set E covering all edge-directions of the polytope conv(S') underlying a convex discrete 
optimization problem over a finite set S C Z," allows to solve it by solving polynomially 
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many linear discrete optimization counterparts over S. The following theorem extends and 
unifies the corresponding reductions in [49] and [17] for convex combinatorial optimization 
and convex integer programming respectively. Recall from §1.3 that the radius of a finite 
set 5* C is defined to be p{S) := max{|a;i| : x G S, i = 1, . . . , n}. 

Theorem 2.4 For every fixed d there is a strongly polynomial time algorithm that, given 
finite set C Z" presented by a linear discrete optimization oracle, integer vectors 
wi, . . . ,Wd & Z", set E C covering all edge-directions o/conv(S'), and convex functional 
c : — > M presented by a comparison oracle, encoded as [n, \E\; {p{S),wi, . . . , Wd, E)], 
solves the convex discrete optimization problem 

max {c{wix, . . . , Wdx) : x & S} . 

Proof. First, query the linear discrete optimization oracle presenting S on the trivial linear 
functional w = 0. If the oracle asserts that there is no optimal solution then S is empty so 
terminate the algorithm asserting that no optimal solution exists to the convex discrete 
optimization problem either. So assume the problem is feasible. Let P := conv(S') C M" 
and Q := {{wix, . . . ,Wdx) : x G P} C M*^. Then Q is a projection of P, and hence by 
Lemma 2.3 the projection D :— {{wis, . . . , Wde) : e E E} oi the set £■ is a set covering all 
edge-directions of Q. Let Z :— zonc(D) C be the zonotope generated by D. Since d is 
fixed, by Lemma 2.2 we can produce in strongly polynomial time all vertices of Z, every 
vertex v along with a linear functional G maximized over Z uniquely at v. For each 
of these polynomially many h^, repeat the following procedure. Define a vector G Z" 
by :— Yli=i '^i,jhv,i for j' = 1, . . . , n. Now query the linear discrete optimization oracle 
presenting S on the linear functional w :— & Z". Let x^ E S he the optimal solution 
obtained from the oracle, and let := {wiX^, . . . ,WdXy) G Q be its projection. Since 
P = conv(5'), we have that x^ is also a maximizer of g^ over P. Since for every x E P 
and its projection z := {wix, . . . ,Wdx) E Q we have h^,z = g^x, we conclude that z^ is a 
maximizer of over Q. Now we claim that each vertex u oi Q equals some z^. Indeed, 
since Z is a refinement of Q by Lemma 2.1, it follows that there is some vertex v of Z such 
that hy is maximized over Q uniquely at u, and therefore u — Zy. Since c{wiX, . . . ,Wdx) 
is convex on and c is convex on M'', we find that 

max c{wix, . . . , Wdx) — max c{wix, . . . , Wdx) = max c{z) 
xes xep zgq 

— max{c(M) : u vertex of Q} = max{c(2;^) : v vertex of Z} . 

Using the comparison oracle of c, find a vertex v of Z attaining maximum value c{zy), 
and output optimal solution to the convex discrete optimization problem. □ 
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2.3 Pseudo Polynomial Reduction when Edge-Directions are 
not Available 

Theorem 2.4 reduces convex discrete optimization to polynomially many linear discrete 
optimization counterparts when a set covering all edge-directions of the underlying poly- 
topc is available. However, often such a set is not available (see e.g. [8] for the important 
case of bipartite matching). We now show how to reduce convex discrete optimization to 
many linear discrete optimization counterparts when a set covering all edge-directions is 
not offhand available. In the absence of such a set, the problem is much harder, and the 
algorithm below is polynomially bounded only in the unary length of the radius p{S) and 
of the linear functionals wi, . . . , Wd, rather than in their binary length {p{S),wi, . . . , Wd) 
as in the algorithm of Theorem 2.4. Moreover, an upper bound p > p[S) on the radius of 
S is required to be given explicitly in advance as part of the input. 

Theorem 2.5 For every fixed d there is a polynomial time algorithm that, given finite 

set S* C Z" presented by a linear discrete optimization oracle, integer p > p{S), vectors 
Wi, . . . ,Wd G Z", and convex functional c : M'^ — > M presented by a comparison oracle, 
encoded as [p,wi, . . . ,Wd], solves the convex discrete optimization problem 

max {c{wix, . . . , wax) : x & S} . 

Proof. Let P := conv(5') C M", let T := {{wiX, . . . ,Wdx) : x E S} he the projection 
of 5" by Wi, . . . ,Wd, and let Q :— conv(T) C W' be the corresponding projection of P. 
Let r npmaxf^^\\wi\\oo and let G := {— r, . . . , —1,0, 1, . . . ,r}'^. Then T C G and the 
number (2r + 1)*^ of points of G is polynomially bounded in the input as encoded. 

Let D := {u — V : u,v E G,u ^ v} he the set of differences of pairs of distinct point 
of G. It covers all edge-directions of Q since vert((5) <ZG. Moreover, the number of 
points of D is less than (2r -|- 1)^'^ and hence polynomial in the input. Now invoke the al- 
gorithm of Theorem 2.4: while the algorithm requires a set E covering all edge-directions 
of P, it needs E only to compute a set D covering all edge-directions of the projection Q 
(see proof of Theorem 2.4), which here is computed directly. □ 
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3 Convex Combinatorial Optimization and More 



In this section we discuss convex combinatorial optimization. The main result is that con- 
vex combinatorial optimization over a set 5" C {0, 1}" presented by a membership oracle 
can be solved in strongly polynomial time provided it is endowed with a set covering all 
edge-directions of conv(S'). In particular, the standard linear combinatorial optimization 
problem over S can be solved in strongly polynomial time as well. In §3.1 we provide some 
preparatory statements involving various oracle presentation of the feasible set S. In §3.2 
we combine these preparatory statements with Theorem 2.4 and prove the main result of 
this section. An extension to arbitrary finite sets 5" C endowed with edge-directions 
is established in §3.3. Wc conclude with some applications in §3.4. 

As noted in the introduction, when S is contained in {0, 1}" we refer to discrete 
optimization over S also as combinatorial optimization over S, to emphasize that S 
typically represents a family ^ C 2^ of subsets of a ground set :— {1, . . . ,n} pos- 
sessing some combinatorial property of interest (for instance, the family of bases of 
a matroid over N, see §3.4.2). The convex combinatorial optimization problem then 
also has the following interpretation (taken in [47, 49]). We are given a weighting 
uj : A^ — > Tf" of elements of the ground set by d-dimensional integer vectors. We in- 
terpret the weight vector e of element j as representing its value under d criteria 
(e.g., if A" is the set of edges in a network then such criteria may include profit, reliabil- 
ity, flow velocity, etc.). The weight of a subset F C A^ is the sum a;(F) := ^j^pij^ij) 
of weights of its elements, representing the total value of F under the d criteria. Now, 
given a convex functional c : — > M, the objective function value of F C A^ is the 
"convex balancing" c{lu{F)) of the values of the weight vector of F. The convex combi- 
natorial optimization problem is to find a family member F ^ T maximizing c{ijj{F)). 
The usual linear combinatorial optimization problem over T is the special case of ci = 1 
and c the identity on M. To cast a problem of that form in our usual setup just let 
S := {Ip '■ F & JF} C {0, 1}" be the set of indicators of members of T and define weight 
vectors wi, . . . , e by Wij := uj{j)i for i = 1, . . . , d and j = 1, . . . ,n. 

3.1 From Membership to Linear Optimization 

A membership oracle for a set 5' C Z" is one that, queried on a; e Z", asserts whether or 
not X & S. An augmentation oracle for S is one that, queried on x & S and w e Z", either 
returns an x & S with wx > wx, i.e. a better point of S, or asserts that none exists, i.e. 
X is optimal for the linear discrete optimization problem over S. 

A membership oracle presentation of S is very broad and available in all reasonable 
applications, but reveals little information on S, making it hard to use. However, as we 
now show, the edge-directions of conv(S') allow to convert membership to augmentation. 
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Lemma 3.1 There is a strongly polynomial time algorithm that, given set S C {0, 1}" 

presented by a membership oracle, x G S , w G Z", and set E C Z" covering all edge- 
directions of the polytope conv(5'), encoded as [n, \E\; {x,w,E)], either returns a better 
point X G S, that is, one satisfying wx > wx, or asserts that none exists. 

Proof. Each edge of P := conv(S') is the difference of two {0, l}-vectors. Therefore, each 
edge direction of P is, up to scahng, a { — 1,0, l}-vector. Thus, scahng e := ■pj^e and 
e := — e if necessary, we may and will assume that e G (—1,0, 1}"^ and we > for all 
e E E. Now, using the membership oracle, check if there is an e G such that x + e e S 
and we > 0. If there is such an e then output x :— x -\-e which is a better point, whereas 
if there is no such e then terminate asserting that no better point exists. 

Clearly, if the algorithm outputs an x then it is indeed a better point. Conversely, 
suppose X is not a maximizer of w over S. Since S C {0, 1}", the point a; is a vertex of 
P. Since x is not a maximizer of w, there is an edge [x, x] of P with x a vertex satis- 
fying wx > wx. But then e :— x — x is the one {—1, 0, 1} edge-direction of [x, x] with 
we> and hence e G Thus, the algorithm will find and output x — x+e as it should. □ 

An augmentation oracle presentation of a finite S allows to solve the linear discrete 
optimization problem maxima; : x E S} over S by starting from any feasible x E S and 
repeatedly augmenting it until an optimal solution x* & S is reached. The next lemma 
bounds the running time needed to reach optimality using this procedure. While the 
running time is polynomial in the binary length of the linear functional w and the initial 
point X, it is more sensitive to the radius p{S) of the feasible set S, and is polynomial only 
in its unary length. The lemma is an adaptation of a result of [30, 57] (stated therein for 
{0, l}-sets), which makes use of bit-scahng ideas going back to [23]. 

Lemma 3.2 There is a polynomial time algorithm that, given finite set S C presented 
by an augmentation oracle, x E S, and w G Z", encoded as [p{S), {x,w)], provides an 
optimal solution x* E S to the linear discrete optimization problem msix{wz : z E S}. 

Proof. Let k := max"^j^ [loggdwjl + 1)] and note that k < (w). For i = 0, . . . , k define a 
linear functional Ui = (m^,!, . . . ,Mi,n) G Z" by Uij := sign{wj)l2^~''\wj\\ for j = 1, . . . ,n. 
Then uq — 0, Uk — w, and Ui — 2ui-i G {—1, 0, 1}" for all i = 1, . . . , /c. 

We now describe how to construct a sequence of points yo,yi, . . . ,yk £ S such that yi 
is an optimal solution to maxl-Uj?/ : y E S} for all i. First note that all points of S are 
optimal for uq = and hence we can take yo := x to be the point of S given as part of 
the input. We now explain how to determine y^ from for i = 1, . . . , fc. Suppose 
has been determined. Set y :— yi-\. Query the augmentation oracle on. y E S and Ui] if 
the oracle returns a better point y then set y :— y and repeat, whereas if it asserts that 
there is no better point then the optimal solution for Ui is read off to be := y. We now 
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bound the number of calls to the oracle. Each time the oracle is queried on y and Ui and 

returns a better point y, the improvement is by at least one, i.e. Ui{y — y) > 1; this is so 
because Ui, y and y are integer. Thus, the number of necessary augmentations from 
to yi is at most the total improvement, which we claim satisfies 

UiiVi - Vi-i) = (^i - 2ui-i){yi - yi^i) + 2ui-i{yi - yi-i) < 2np + = 2np , 

where p :— p{S). Indeed, Ui — 2ui-i e { — 1,0,1}" and yi,yi-i & S G [— p, p]"^ imply 
{ui - 2ui-i){yi - yi^i) < 2np; and y^-i optimal for Ui-i gives Ui-i{yi - yi^i) < 0. 

Thus, after a total number of at most 2npk calls to the oracle we obtain yk which is 
optimal for u^. Since w = we can output x* := yk as the desired optimal solution to 
the linear discrete optimization problem. Clearly the number 2npk of calls to the oracle, 
as well as the number of arithmetic operations and binary length of numbers occurring 
during the algorithm, are polynomial in p{S), {x,w). This completes the proof. □ 

We conclude this preparatory subsection by recording the following result of [24] which 
incorporates the heavy simultaneous Diophantine approximation of [44]. 

Proposition 3.3 There is a strongly polynomial time algorithm that, given w e Z", 

encoded as [n: {w)], produces w ^IT' , whose binary length (w) is polynomially bounded in 
n and independent of w, and with sign(w2;) — sign(w^) for every z e {—1, 0, 1}". 

3.2 Linear and Convex Combinatorial Optimization in Strongly 
Polynomial Time 

Combining the preparatory statements of §3.1 with Theorem 2.4, we can now solve the 
convex combinatorial optimization over a set S* C {0, 1}" presented by a membership ora- 
cle and endowed with a set covering all edge-directions of conv(5') in strongly polynomial 
time. We start with the special case of linear combinatorial optimization. 

Theorem 3.4 There is a strongly polynomial time algorithm that, given set S" C {0,1}" 
presented by a membership oracle, x & S, w E Z", and set C Z" covering all edge- 
directions of the polytope conv(S'), encoded as [n, \E\; {x,w,E)], provides an optimal so- 
lution X* & S to the linear combinatorial optimization problem max{u'2; : z & S}. 

Proof. First, an augmentation oracle for S can be simulated using the membership oracle, 
in strongly polynomial time, by applying the algorithm of Lemma 3.1 

Next, using the simulated augmentation oracle for S, we can now do linear optimiza- 
tion over S in strongly polynomial time as follows. First, apply to w the algorithm of 
Proposition 3.3 and obtain w e Z" whose binary length (w) is polynomially bounded in 
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n, which satisfies sign{wz) = sign{wz) for every z G { — 1,0, l}". Since S C {0, l}", it is 
finite and has radius p{S) = 1. Now apply the algorithm of Lemma 3.2 to S, x and w, and 
obtain a maximizer x* of w over S. For every y G {0, 1}" we then have — y G { — 1, 0, 1}" 
and hence sign(w(a;* — y)) = sign(w(a;* — y)). So x* is also a maximizer of w over S and 
hence an optimal solution to the given linear combinatorial optimization problem. Now, 
p{S) — 1, (w) is polynomial in n, and x G {0, 1}"^ and hence (x) is linear in n. Thus, the 
entire length of the input [p{S), {x, w)] to the polynomial-time algorithm of Lemma 3.2 is 
polynomial in n, and so its running time is in fact strongly polynomial on that input. □ 

Combining Theorems 2.4 and 3.4 we recover at once the following result of [49]. 

Theorem 3.5 For every fixed d there is a strongly polynomial time algorithm that, given 
set S C {0,1}"" presented by a membership oracle, x E S, vectors ivi. . . . ,Wd G Z", 
set C covering all edge- directions of the polytope conv(5'), and convex functional 
c : — > R presented by a comparison oracle, encoded as [n,\E\; {x,wi, . . . ,Wd, E)], 
provides an optimal solution x* & S to the convex combinatorial optimization problem 

max {c{wiz, . . . , Wdz) : z E S} . 

Proof. Since S is nonempty, a linear discrete optimization oracle for S can be simulated in 
strongly polynomial time by the algorithm of Theorem 3.4. Using this simulated oracle, 
we can apply the algorithm of Theorem 2.4 and solve the given convex combinatorial 
optimization problem in strongly polynomial time. □ 



3.3 Linear and Convex Discrete Optimization over any Set in 
Pseudo Polynomial Time 

In §3.2 above we developed strongly polynomial time algorithms for linear and convex 
discrete optimization over {0, l}-sets. We now provide extensions of these algorithms to 

arbitrary finite sets 5 C Z"^. As can be expected, the algorithms become slower. 

We start by recording the following fundamental result of Khachiyan [40] asserting 
that linear programming is polynomial time solvable via the ellipsoid method [63]. This 
result will be used below as well as several more times later in the monograph. 

Proposition 3.6 There is a polynomial time algorithm that, given A G Z"*^", b G Z'", 
and w G Z"^, encoded as [{A,b,w)], either asserts that P := {x E W"- : Ax < 6} is 
empty, or asserts that the linear functional wx is unbounded over P, or provides a vertex 
V E vert(P) which is an optimal solution to the linear program max{wx : x E P}. 
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The following analog of Lemma 3.1 shows how to covert membership to augmentation 
in polynomial time, albeit, no longer in strongly polynomial time. Here, both the given 
initial point x and the returned better point x if any, are vertices of conv(5'). 

Lemma 3.7 There is a polynomial time algorithm that, given finite set S (ZlP" presented 
by a membership oracle, vertex x of the polytope conv(S'), w G Z", and set C Z" 
covering all edge- directions ci/conv(S'), encoded as [p{S), {x, w, E)], either returns a better 
vertex x of cony^S), that is, one satisfying wx > wx, or asserts that none exists. 

Proof. Dividing each vector e e by the greatest common divisor of its entries and 

setting e := — e if necessary, we can and will assume that each e is primitive, that is, its 
entries are relatively prime integers, and we > 0. Using the membership oracle, construct 
the subset F C E of those e E E for which x + re E S for some r G {1, . . . , 2p{S)}. Let 
G C F be the subset of those / G F for which wf > 0. If G is empty then terminate 
asserting that there is no better vertex. Otherwise, consider the convex cone cone(F) 
generated by F. It is clear that x is incident on an edge of conv(S') in direction / if 
and only if / is an extreme ray of cone(F). Moreover, since G = {f E F : wf > 0} is 
nonempty, there must be an extreme ray of cone(F) which lies in G. Now / G -F is an 
extreme ray of cone(F) if and only if there do not exist nonnegative Ag, e G F \ {/}, such 
that / = ^ejtf^e^'i ^^^^ ^® checked in polynomial time using linear programming. 
Applying this procedure to each f E G, identify an extreme ray g E G. Now, using 
the membership oracle, determine the largest r G {1, . . . ,2^(5*)} for which x + rg E S. 
Output X := X + rg which is a better vertex of conv(5'). □ 

We now prove the extensions of Theorems 3.4 and 3.5 to arbitrary, not necessarily 
{0, l}-valued, finite sets. While the running time remains polynomial in the binary length 
of the weights Wi, . . . ,Wd and the set of edge-directions E, it is more sensitive to the radius 
p{S) of the feasible set S, and is polynomial only in its unary length. Here, the initial 
feasible point and the optimal solution output by the algorithms are vertices of conv(5'). 
Again, we start with the special case of linear combinatorial optimization. 

Theorem 3.8 There is a polynomial time algorithm that, given finite 5 C Z" presented 
by a membership oracle, vertex x of the polytope conv(S'), w E Z", and set F C Z" 
covering all edge- directions of conv{S), encoded as [p{S), {x,w,E)], provides an optimal 
solution X* E S to the linear discrete optimization problem max{wz : z E S}. 

Proof. Apply the algorithm of Lemma 3.2 to the given data. Consider any query x' E S, 
w' E Z" made by that algorithm to an augmentation oracle for S. To answer it, apply 
the algorithm of Lemma 3.7 to x' and w'. Since the first query made by the algorithm 
of Lemma 3.2 is on the given input vertex x' :— x, and any consequent query is on 
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a point x' := X which was the reply of the augmentation oracle to the previous query 
(see proof of Lemma 3.2), we see that the algorithm of Lemma 3.7 will always be asked 
on a vertex of S and reply with another. Thus, the algorithm of Lemma 3.7 can answer 
all augmentation queries and enables the polynomial time solution of the given problem. □ 

Theorem 3.9 For every fixed d there is a polynomial time algorithm that, given finite 
set 5* C Z" presented by membership oracle, vertex x of conv{S) , vectors Wi, . . . ,Wd & , 
set C Z" covering all edge- directions of the polytope conv(5'), and convex functional 
c : W'' — > R presented by a comparison oracle, encoded as [p{S), {x,Wi, . . . ,Wd, E)], 
provides an optimal solution x* & S to the convex combinatorial optimization problem 

max {c{wiz, . . . , Wdz) : z G S} . 

Proof. Since S is nonempty, a linear discrete optimization oracle for S can be simulated 
in polynomial time by the algorithm of Theorem 3.8. Using this simulated oracle, we can 
apply the algorithm of Theorem 2.4 and solve the given problem in polynomial time. □ 



3.4 Some Applications 

3.4.1 Positive Semidefinite Quadratic Binary Programming 

The quadratic binary programming problem is the following: given an nxn matrix M, find 
a vector x e {0, 1}" maximizing the quadratic form x^Mx induced by M. We consider 
here the instance where M is positive semidefinite, in which case it can be assumed to be 
presented as M = W'^W with W a given dx n matrix. Already this restricted version is 
very broad: if the rank d of W and M is variable then, as mentioned in the introduction, 
the problem is NP-hard. We now show that, for fixed d, Theorem 3.5 implies at once that 
the problem is strongly polynomial time solvable (see also [2]). 

Corollary 3.10 For every fixed d there is a strongly polynomial time algorithm that given 
W G Z'^^", encoded as [n; {W)], finds x* e {0, l}" maximizing the form x^W^Wx. 

Proof. Let S := {0, 1}" and let E := {li, . . . , 1„} be the set of unit vectors in M". Then 
P := conv(S') is just the n-cube [0, 1]" and hence E covers all edge-directions of P. A 
membership oracle for S is easily and efficiently realizable and a; := G S* is an initial 
point. Also, \E\ and (E) are polynomial in n, and E is easily and efficiently computable. 

Now, for i — 1, . . . , define Wi G Z" to be the i-th row of the matrix W, that is, 
Wij := Wij for all i.j. Finally, let c : — > M be the squared I2 norm given by 
c{y) :— \\y\\l '■— X^j=i2/|, and note that the comparison of c{y) and c{z) can be done for 
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y,z & X'^ in time polynomial in (y, z) using a constant number of arithmetic operations, 
providing a strongly polynomial time realization of a comparison oracle for c. 

This translates the given quadratic programming problem into a convex combinatorial 
optimization problem over S, which can be solved in strongly polynomial time by apply- 
ing the algorithm of Theorem 3.5 to x — 0, wi, . . . , wa, E, and c. □ 



3.4.2 Matroids and Mciximum Norm Spanning Trees 

Optimization problems over matroids form a fundamental class of combinatorial opti- 
mization problems. Here we discuss matroid bases, but everything works for independent 
sets as well. Recall that a family B of subsets of {!,... .n} is the family of bases of a 
matroid if all members of B have the same cardinality, called the rank of the matroid, 
and for every B,B' e B and i e B \ B' there is a j e B' such that B \ {i} U {j} e B. 
Useful models include the graphic matroid of a graph G with edge set {1, . . . , n} and B 
the family of spanning forests of G, and the linear matroid of an m x n matrix A with B 
the family of sets of indices of maximal linearly independent subsets of columns of A. 

It is well known that linear combinatorial optimization over matroids can be solved 
by the fast greedy algorithm [22]. We now show that, as a consequence of Theorem 3.5, 
convex combinatorial optimization over a matroid presented by a membership oracle can 
be solved in strongly polynomial time as well (see also [34, 47]). We state the result 
for bases, but the analogous statement for independent sets hold as well. We say that 
S C {0, l}*^ is the set of bases of a matroid if it is the set of indicators of the family B of 
bases of some matroid, in which case we call conv(5') the matroid base polytope. 

Corollary 3.11 For every fixed d there is a strongly polynomial time algorithm that, 
given set S C {0, 1}" of bases of a matroid presented by a membership oracle, x E S, 
Wi, . . . ,W(i e Z", and convex functional c : R*^ — > M presented by a comparison oracle, 
encoded as [n; {x,Wi, . . . , wa)], solves the convex matroid optimization problem 

max {c{wiz, . . . , Wdz) : z & S} . 

Proof. Let E :— {Ij — 1^ : 1 < i < j < n} be the set of differences of pairs of unit 
vectors in M". We claim that E covers all edge- directions of the matroid base polytope 
P := conv(S'). Consider any edge e = [y, y'] of P with y,y' E S and let B := supp(|/) and 
B' := supp(7/') be the corresponding bases. Let h G M** be a linear functional uniquely 
maximized over P at e. If B \ B' = {i} is a singleton then B' \ B = {j} is a singleton 
as well in which case y — y' — li — Ij and we are done. Suppose then, indirectly, that it 
is not, and pick an element i in the symmetric difference BAB' := {B \ B') U {B' \ B) 
of minimum value hi. Without loss of generality assume i e B \ B' . Then there is a 
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j E B' \ B such that B" := B\ {i} U {j} is also a basis. Let y" G 5 be the indicator of 
B" . Now \Bl^B'\ > 2 imphes that B" is neither B nor B'. By the choice of i we have 
hy" = hy — hi + hj > hy. So y" is also a maximizer of /i over P and hence € e. But 
no {0, l}-vector is a convex combination of others, a contradiction. 

Now, l^l = (2) and E C {—1,0,1}" imply that and {E) are polynomial in n. 
Moreover, E can be easily computed in strongly polynomial time. Therefore, applying 
the algorithm of Theorem 3.5 to the given data and the set E, the convex discrete opti- 
mization problem over S can be solved in strongly polynomial time. □ 

One important application of Corollary 3.11 is a polynomial time algorithm for com- 
puting the universal Grobner basis of any system of polynomials having a finite set of 

common zeros in fixed (but arbitrary) number of variables, as well as the construction of 
the state polyhedron of any member of the Hilbert scheme, see [5, 51]. Other important ap- 
plications are in the field of algebraic statistics [52], in particular for optimal experimental 
design. These applications are beyond our scope here and will be discussed elsewhere. 

Here is another concrete example of a convex matroid optimization application. 

Example 3.12 (maximum norm spanning tree). Fix any positive integer d. Let 

II ■ lip : R'^ — > M be the Ip norm given by ||x||p := (XliLi k«|^)^ for 1 < p < cxd and 
Halloo := max^^;^ \xi\. Let G be a connected graph with edge set N := {1, . . . ,n}. For 
J = 1, . . . , n let e Z*^ be a weight vector representing the values of edge j under some d 
criteria. The weight of a subset T C N is the sum ^j^^ '"i representing the total values 
of T under the d criteria. The problem is to find a spanning tree T of G whose weight 
has maximum Ip norm, that is, a spanning tree T maximizing || Xljer'^i Hp- 
Define . . . , G Z" by Wij := Uj^i for i = 1, . . . , d, j = 1, . . . ,n. Let S C {0, 1}" be 
the set of indicators of spanning trees of G. Then, in time polynomial in n, a membership 
oracle for S is realizable, and an initial x & S is obtainable as the indicator of any greedily 
constructible spanning tree T. Finally, define the convex functional c := || • Then for 
most common values p = 1, 2, 00, and in fact for any p G N, the comparison of c{y) and 
c{z) can be done for y, z E 'E'^ in time polynomial in (y, z,p) by computing and comparing 
the integer valued p-th powers \\y\\p and ||-2||p. Thus, by Corollary 3.11, this problem is 
solvable in time polynomial in {ui, . . . , Un,p)- 
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4 Linear N-fold Integer Programming 



In this section we develop a theory of hnear n-fold integer programming, which leads to 
the polynomial time solution of broad classes of linear integer programming problems in 
variable dimension. This will be extended to convex n-fold integer programming in §5. 

In §4.1 wc describe an adaptation of a result of [56] involving an oriented version of 
the augmentation oracle of §3.1. In §4.2 we discuss Graver bases and their application 
to linear integer programming. In §4.3 we show that Graver bases of n-fold matrices can 
be computed efficiently. In §4.4 we combine the preparatory statements from §4.1, §4.2, 
and §4.3, and prove the main result of this section, asserting that linear n-fold integer 
programming is polynomial time solvable. Wc conclude with some applications in §4.5. 

Here and in §5 we concentrate on discrete optimization problems over a set S presented 
as the set of integer points satisfying an explicitly given system of linear inequalities. 
Without loss of generality we may and will assume that S is given either in standard form 
,5 := {x e : Ax^ b} where A e Z"*^" and b e Z"*, or in the form 

S := {x eZ"" : Ax = b, I <x <u} 

where l,u E Z^ and = Z l±l {±oo}, where some of the variables are bounded below 
or above and some are unbounded. Thus, S is no longer presented by an oracle, but by 
the explicit data A,b and possibly l,u. In this setup wc refer to discrete optimization 
over S also as integer programming over S. As usual, an algorithm solving the problem 
must either provide an x & S maximizing wx over S, or assert that none exists (either 
because S is empty or because the objective function is unbounded over the underlying 
polyhedron). We will sometimes assume that an initial point x & S is given, in which 
case b will be computed as b :— Ax and not be part of the input. 

4.1 Oriented Augmentation and Linear Optimization 

We have seen in §3.1 that an augmentation oracle presentation of a finite set 5* C Z" 
enables to solve the linear discrete optimization problem over S. However, the running 
time of the algorithm of Lemma 3.2 which demonstrated this, was polynomial in the unary 
length of the radius p{S) of the feasible set rather than in its binary length. 

In this subsection we discuss a recent result of [56] and show that, when S is presented 
by a suitable stronger oriented version of the augmentation oracle, the linear optimization 
problem can be solved by a much faster algorithm, whose running time is in fact poly- 
nomial in the binary length {p{S)). The key idea behind this algorithm is that it gives 
preference to augmentations along interior points of conv(5') staying far off its boundary. 
It is inspired by and extends the combinatorial interior point algorithm of [61]. 
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For any vector g eW^, let g'^,g~ G denote its positive and negative parts, defined 
by '■= Taax{gj, 0} and g^ := — min{gj, 0} for j = 1, . . . , n. Note that both g~^, g~ are 
nonnegativc, supp((y') = supp((?+) l+J supp((y'^), and g = g^ — g~ ■ 

An oriented augmentation oracle for a set C is one that, queried on a; G 5* and 
Wj^,W- e Z", either returns an augmenting vector g e Z", defined to be one satisfying 
X -\- g & S and Wj^g'^ — w_g~ > 0, or asserts that none exists. 

Note that this oracle involves two linear functionals w^,w_ G Z" rather than one 
{w+,w^ are two distinct independent vectors and not the positive and negative parts 
of one vector). The conditions on an augmenting vector g indicate that it is a feasible 
direction and has positive value under the nonlinear objective function determined by 
w+,w^. Note that this oracle is indeed stronger than the augmentation oracle of §3.1: 
to answer a query a; G S", w G Z" to the latter, set w+ := w_ := w, thereby obtaining 
w+g^ — W-g~ = wg for all g, and query the former on x, if it replies with an 

augmenting vector g then reply with the better point x := x + g, whereas if it asserts that 
no g exists then assert that no better point exists. 

The following lemma is an adaptation of the result of [56] concerning sets of the form 
S* := {a; G Z" : Ax = b, < a; < -u} of nonnegative integer points satisfying equations 
and upper bounds. However, the pair A, b is neither explicitly needed nor does it affect 
the running time of the algorithm underlying the lemma. It suffices that S is of that form. 
Moreover, an arbitrary lower bound vector I rather than can be included. So it suffices 
to assume that -S" coincides with the intersection of its affine hull and the set of integer 
points in a box, that is, S = aff(S') D {x E Z'^ : I < x < u} where l,u E Z". We now 
describe and prove the algorithm of [56] adjusted to any lower and upper bounds l,u. 

Lemma 4.1 There is a polynomial time algorithm that, given vectors l,u E Z", set 

S G satisfying S = afi^(S') fl {2; G Z" : / < 2; < -u} and presented by an oriented 
augmentation oracle, x E S, and w G Z", encoded as [{l,u,x,w)], provides an optimal 
solution X* E S to the linear discrete optimization problem maxjwz : z E S}. 

Proof. Wc start with some strengthening adjustments to the oriented augmentation oracle. 
Let p := max{||/||oo, ||m||oo} be an upper bound on the radius of S. Then any augment- 
ing vector g obtained from the oriented augmentation oracle when queried on y E S 
and W-^,W- G Z", can be made in polynomial time to be exhaustive, that is, to satisfy 
y + 2g ^ S (which means that no longer augmenting step in direction g can be taken). 
Indeed, using binary search, find the largest r G {!,..., 2p} for which I < y + rg < u; 
then S = aff(S') D {z E Z"' : I < z < u} implies y + rg E S and hence we can replace 
g :— rg. So from here on we will assume that if there is an augmenting vector then the 
oracle returns an exhaustive one. Second, let Roo ]Rl±l{±oo} and for any vector i> G 
let E denote its entry-wise reciprocal defined by v^^ := ^ if Vi and :— 00 
if Vi — 0. For any y E S, the vectors (y — and {u — y)~^ are the reciprocals of the 



26 



"entry-wise distance" of y from the given lower and upper bounds. The algorithm will 
query the oracle on triples y, w^, w_ with Wj^ := w — ^{u — y)^^ and w_ := w + n{y — 
where /i is a suitable positive scalar and w is the input linear functional. The fact that 
such w+, W- may have infinite entries does not cause any problem: indeed, if g is an aug- 
menting vector then y + g & S implies that g^ — whenever = Ui and g~ — whenever 
li = yi, so each infinite entry in w+ or occurring in the expression w+g'^ — w^g~ is 
multiplied by and hence zeroed out. 

The algorithm proceeds in phases. Each phase i starts with a feasible point G S 
and performs repeated augmentations using the oriented augmentation oracle, terminating 
with a new feasible point yi & S when no further augmentations are possible. The queries 
to the oracle make use of a positive scalar parameters //j fixed throughout the phase. The 
first phase {i=l) starts with the input point yo := x and sets Hi := p ||w||oo- Each further 
phase i > 2 starts with the point obtained from the previous phase and sets the 
parameter value Hi := to be half its value in the previous phase. The algorithm 

terminates at the end of the first phase i for which /Xj < ^, and outputs x* := yi. Thus, 
the number of phases is at most [log2(2np||u'||oo)l and hence polynomial in {l,u,w). 

We now describe the i-th phase which determines yi from yi-i. Set /ij := and 
ij := yi-i. Iterate the following: query the strengthened oriented augmentation oracle on 
y, w+ := w — Hiiu — y)~^, and W- := w + fii{y — /)~^; if the oracle returns an exhaustive 
augmenting vector g then set y := y + g and repeat, whereas if it asserts that there is no 
augmenting vector then set yi :— y and complete the phase. If //j > ^ then proceed to 
the {i + l)-th phase, else output x* := yi and terminate the algorithm. 

It remains to show that the output of the algorithm is indeed an optimal solution and 
that the number of iterations (and hence calls to the oracle) in each phase is polynomial 
in the input. For this we need the following facts, the easy proofs of which are omitted: 

1. For every feasible y & S and direction g with y + g & S also feasible, we have 

{u-yy^g+ + {y-l)-^g- < n . 

2. For every y & S and direction g with y + g e S but y + 2g ^ S, we have 

{u-y)-'g+ + {y-l)-^g- > ^ . 

3. For every feasible y E S, direction g with y + g E S also feasible, and A* > 0, setting 
w+ :— w — ii{u — y)~^ and w_ :— w -\- iJ,{y — we have 

w+g^-w^g~ = wg - ii{{u- yy^g^ + {y - ly^g-) . 
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Now, consider the last phase i with /Xj < -, let x* :— yi :— y be the output of the 
algorithm at the end of this phase, and let a; G be any optimal solution. Now, the 
phase is completed when the oracle, queried on the triple y, = w — fii{u — y)~^ , and 
w_ = w + iii{y — , asserts that there is no augmenting vector. In particular, setting 
g :— X — y, we find w+g'^ — w^g" < and hence, by facts 1 and 3 above, 

wx — wx* — wg < /li (^{u — y)~^ g'^ + {y — l)"^ g") < —n — 1. 

Since wx and wx* are integer, this implies that in fact wx — wx* < and hence the output 
X* of the algorithm is indeed an optimal solution to the given optimization problem. 

Next we bound the number of iterations in each phase i starting from e S. Let 
again x & S he any optimal solution. Consider any iteration in that phase, where the 
oracle is queried on y, w^ = w — fii{u — y)~^, and w^ = w + iii{y — and returns an 
exhaustive augmenting vector g. We will now show that 

w{y + g)-wy > -^{wx - wyi-i) , (1) 

that is. the increment in the objective value from y to the augmented point ^ + is at 
least ^ times the difference between the optimal objective value wx and the objective 
value wyi_i of the point at the beginning of phase i. This shows that at most 4n 
such increments (and hence iterations) can occur in the phase before it is completed. 

To establish (1), we show that wg > and wx — wyi^i < 2niii. For the first 
inequality, note that g is an exhaustive augmenting vector and so w^g^ — w_g~ > and 
y + 2g ^ S and hence, by facts 2 and 3, wg > jii^iu — y)^^g^ + ~ ^)^^9 ) > 
We proceed with the second inequality. If i = 1 (first phase) then this indeed holds 
since wx — wyo < 2np||w||oo = 2n/xi. If i > 2, let w+ := w — Hi-i{u — |/j_i)~^ and 
w^ :— w + //j_i(yj_i — The (i — l)-th phase was completed when the oracle, queried 
on the triple y^-i, w+, and w_, asserted that there is no augmenting vector. In particular, 
for g :— X — yi-i, we find w+g~^ — W-g" < and so, by facts 1 and 3, 

wx - wyi-i = wg< {{u - yi-iY^g^ + {yi-i - iy^g~)) < Hi-in = 2nfj.i . □ 

4.2 Graver Bases and Linear Integer Programming 

We now come to the definition of a fundamental object introduced by Graver in [28]. The 
Graver basis of an integer matrix A is a canonical finite set Q{A) that can be defined 
as follows. Define a partial order □ on which extends the coordinate-wise order < 
on as follows: for two vectors u,v G put -u □ f and say that u is conform,al to 
V if \ui\ < \vi\ and UiVi > for i = 1, . . . ,n, that is, u and v lie in the same orthant 
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of and each component of u is bounded by the corresponding component of v in 
absolute value. It is not hard to see that C is a well partial ordering (this is basically 
Dickson's lemma) and hence every subset of has finitely-many C-minimal elements. 
Let jC.{A) := {a; G : Ax = 0} be the lattice of linear integer dependencies on A. The 
Graver basis of A is defined to be the set G{A) of all minimal vectors in C{A) \ {0}. 

Note that if A is an m x n matrix then its Graver basis consist of vectors in Z". We 
sometimes write Q{A) as a suitable x n matrix whose rows are the Graver basis 

elements. The Graver basis is centrally symmetric {g G Q{A) implies —g G Q{A)); thus, 
when listing a Graver basis we will typically give one of each antipodal pair and prefix 
the set (or matrix) by ±. Any element of the Graver basis is primitive (its entries are 
relatively prime integers) . Every circuit of A (nonzero primitive minimal support element 
of ^^(^4)) is in Q{A); in fact, if A is totally unimodular then Q{A) coincides with the set 
of circuits (see §5.1 in the sequel for more details on this). However, in general Q{A) is 
much larger. For more details on Graver bases and their connection to Grobner bases see 
Sturmfels [58] and for the currently fastest procedure for computing them see [35, 36]. 

Here is a quick simple example; we will see more structured and complex examples 
later on. Consider the 1x3 matrix A := (1, 2, 1). Then its Graver basis can be shown to be 
the set g{A) = ±{(2, -1, 0), (0, -1, 2), (1, 0, -1), (1,-1, 1)}. The first three elements (and 
their antipodes) are the circuits of A; already in this small example non-circuits appear 
as well: the fourth element (and its antipode) is a primitive linear integer dependency 
whose support is not minimal. 

We now show that when we do have access to the Graver basis, it can be used to solve 
linear integer programming. We will extend this in §5, where we show that the Graver 
basis enables to solve convex integer programming as well. In §4.3 we will show that there 
are important classes of matrices for which the Graver basis is indeed accessible. 

First, we need a simple property of Graver bases. A finite sum u :— Vi of vectors 
Vi G is conformal if each summand is conformal to the sum, that is, ViQu for all i. 

Lemma 4.2 Let A be any integer matrix. Then any h G JC{A) \ {0} can be written as a 
conformal sum h :—'^gi of (not necessarily distinct) Graver basis elements gi G G{A). 

Proof. By induction on the well partial order C. Recall that g{A) is the set of C-minimal 
elements in C{A) \ {0}. Consider any h G C{A) \ {0}. If it is C-minimal then h G g{A) 
and we are done. Otherwise, there is a /i' G G{A) such that h' C h. Set h" :— h — h'. 
Then h" G JC{A) \ {0} and h" C h, so by induction there is a conformal sum h" — g^ 
with gi G Q{A) for all i. Now h — h' + J2i 9i is the desired conformal sum of h. □ 

The next lemma shows the usefulness of Graver bases for oriented augmentation. 
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Lemma 4.3 Let A be an mx n integer matrix with Graver basis Q{A) and let l,u & Z^, 

w^,w_ G Z", and b G Z™. Suppose ,t G T := G Z" : Ay = b,l < y < u}. Then for 
every g G Z" which satisfies x + g E T and Wj^g^ — w^g~ > there exists an element 
g e G{A) with g ^ g which also satisfies x + g E T and w+g'^ — w^g~ > 0. 

Proof. Suppose g eTU^ satisfies the requirements. Then Ag = A{x + g) — Ax = 6 — 6 = 
since x,x + g E T. Thus, g G C{A) \ {0} and hence, by Lemma 4.2, there is a conformal 
sum g = '^j^hi with hi G Q{A) for all i. Now, hi ^ g is equivalent to h^ < g^ and 
< g~, so the conformal sum g — X^i^i gives corresponding sums of the positive and 
negative parts g'^ — and g~ — X] ■ h~ . Therefore we obtain 



which implies that there is some hi in this sum with w+hf — W-h~ > 0. Now, hi G Q{A) 
implies A{x + hi) = Ax = b. Also, I < x,x + g < u and hi ^ g imply that I < x + hi < u. 
So x + hi eT. Therefore the vector g :— hi satisfies the claim. □ 

We can now show that the Graver basis enables to solve linear integer programming 
in polynomial time provided an initial feasible point is available. 

Theorem 4.4 There is a polynomial time algorithm that, given A G Z"*^", its Graver 
basis Q{A), l,u E 11^, G Z" with I < x < u, encoded as [{A,Q{A),l,u,x,w)], solves 
the linear integer program max{u'2; : z G Z", Az — b, I < z < u} with b :— Ax. 

Proof. First, note that the objective function of the integer program is unbounded if and 
only if the objective function of its relaxation max{wy : y G R", Ay — b,l < y < u} is 
unbounded, which can be checked in polynomial time using linear programming. If it is 
unbounded then assert that there is no optimal solution and terminate the algorithm. 

Assume then that the objective is bounded. Then, since the program is feasible, it 
has an optimal solution. Furthermore, (as basically follows from Cramer's rule, see e.g. 
[55, Theorem 17.1]) it has an optimal x* satisfying \xj\ < p for all j, where p is an easily 
computable integer upper bound whose binary length (p) is polynomially bounded in 
{A,l,u,x). For instance, p := {n + l)(n+ l)!r"+^ will do, with r the maximum among 
maxj I AijXjl, maxjj \Aij\, max{|/j| : \lj\ < cxo}, and max{|-Uj| : \uj\ < oo}. 

Let T := {i/ G Z" : Ay = 6, I < y < u} and S := T H [— p, p]". Then our linear 
integer programming problem now reduces to linear discrete optimization over S. Now, 
an oriented augmentation oracle for S can be simulated in polynomial time using the 
given Graver basis G{A) as follows: given a query y E S and w+,w^ G Z", search for 
g e which satisfies w+g~^ — w_g~ > and y + g E S; if there is such a g then return 
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it as an augmenting vector, whereas if there is no such g then assert that no augmenting 
vector exists. Clearly, if this simulated oracle returns a vector g then it is an augmenting 
vector. On the other hand, if there exists an augmenting vector g then y + g & S T 
and w_|_gf+ — W-g~ > imply by Lemma 4.3 that there is also a. g & with g ^ g 

such that W-^g'^ — W-g~ > and y + g & T. Since y,y + g & S and g Q g, we find that 
y + g & S as well. Therefore the Graver basis contains an augmenting vector and hence 
the simulated oracle will find and output one. 

Define /, -u G by Ij := max(/j, —p),Uj := mm(uj, p), j = 1, . . . , n. Then it is easy to 
see that 5* = aff (S*) fl {y e Z" : / < y < «}. Now apply the algorithm of Lemma 4.1 to /, u, 
S, X, and w, using the above simulated oriented augmentation oracle for S, and obtain 
in polynomial time a vector x* & S which is optimal to the linear discrete optimization 
problem over S and hence to the given linear integer program. □ 

As a special case of Theorem 4.4 we recover the following result of [16] concerning 
linear integer programming in standard form when the Graver basis is available. 

Theorem 4.5 There is a polynomial time algorithm that, given matrix A G Z™^", its 
Graver basis Q{A), x G N", and w G Z", encoded as [{A,Q{A),x,w)], solves the linear 
integer programming problem max{wz : z G N", Az — b} where b :— Ax. 

4.3 Graver Bases of N-fold Matrices 

As mentioned above, the Graver basis Q{A) of an integer matrix A contains all circuits 
of A and typically many more elements. While the number of circuits is already typically 
exponential and can be as large as {^i) , the number of Graver basis elements is usually 
even larger and depends also on the entries of A and not only on its dimensions m.n. 
So unfortunately it is typically very hard to compute Q{A). However, we now show that 
for the important and useful broad class of n-fold matrices, the Graver basis is better 
behaved and can be computed in polynomial time. Recall the following definition from 
the introduction. Given an (r + ,s) x t matrix A, let Ai be its r x t sub-matrix consisting 
of the first r rows and let A2 be its s x t sub-matrix consisting of the last s rows. We 
refer to A explicitly as (r -|- s) x t matrix, since the definition below depends also on r 
and s and not only on the entries of A. The n-fold matrix of an (r + s) x t matrix A is 
then defined to be the following (r -|- ns) x nt matrix. 



:= (1„ (8) Ai) © {In A2) 



/A, A, 
A2 
A2 



A, 
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We now discuss a recent result of [54] , which originates in [4] , and its extension in [38] , 
on the stabihzation of Graver bases of n-fold matrices. Consider vectors 
with x'^ G Z* for k = 1, . . . ,n. The type of x is the number \{k : 7^ 0}| of nonzero 
components x^ G of x. The Graver complexity of an (r + s) x t matrix, denoted c{A), 
is defined to be the smallest c G N W {00} such that for all n, the Graver basis of A^") 
consists of vectors of type at most c{A) . We provide the proof of the following result of 
[38, 54] stating that the Graver complexity is always finite. 

Lemma 4.6 The Graver complexity c{A) of any {r + s) x t integer matrix A is finite. 

Proof. Call an element x = {x^,...,x'^) in the Graver basis of some pure if 
a;* G G{A2) for all i. Note that the type of a pure x G Q{A^"^) is n. First, we claim 
that if there is an element of type m in some g{A^^^) then for some n > m there is a pure 
element in Q{A^'^^), and so it will suffice to bound the type of pure elements. Suppose 
there is an element of type m in some Q{A^^^). Then its restriction to its m nonzero 
components is an element x = {x^, . . . , x"^) in Q(A^'^'>). Let x^ = YljLi 9i,j be a conformal 
decomposition of with Qij G G{A2) for all i,j, and let n := ki + ■ ■ ■ + km > rn. Then 
g := ((71,1, . . . , gm,k,n) is in G{A^"'^), else there would be ^ IZ in Q{A^'^^) in which case the 
nonzero x with := X^jLi 9i,j fo^ i would satisfy x \Z x and x G C{A^'^^), contradicting 
X G Q{A^™-^). Thus 5^ is a pure element of type n>m, proving the claim. 

We proceed to bound the type of pure elements. Let ^(^2) = {gi, ■ ■ ■ ,9'm} be the 
Graver basis of A2 and let G2 be the t x m matrix whose columns are the gi. Sup- 
pose X = {x^,...,x^) G Q{A^'^^) is pure for some n. Let v G N™ be the vector with 
Vi := \{k : x^ = gi}\ counting the number of g^ components of x for each i. Then 
Sill is equal to the type n of x. Next, note that A1G2V — ^1(^^=1 x'') = and hence 
V G jC{AiG2). We claim that, moreover, v G ^(^1(^2) ■ Suppose indirectly not. Then there 
is "0 G Q{AiG2) with -0 C f , and it is easy to obtain a nonzero x C x from a; by zeroing out 
some components so that Vi = \{k : x'^ = gi}\ for all i. Then Ai{Y^^=ix'^) — A1G2V — 
and hence x G C{A^"-^), contradicting x G ^(/l^")). 

So the type of any pure element, and hence the Graver complexity of ^4, is at most 
the largest value Y^^i '^i of any nonnegative element v of the Graver basis Q{AiG2). □ 

Using Lemma 4.6 we now show how to compute Q[A^'^^) in polynomial time. 

Theorem 4.7 For every fixed {r + s) x t integer matrix A there is a strongly polyno- 
mial time algorithm that, given n G encoded as [n;n], computes the Graver basis 

Q(A^^^) of the n-f old matrix A^^\ In particular, the cardinality \Q{A'^"-^)\ and binary length 
{Q{A^"'^)) of the Graver basis of the n-fold matrix are polynomially bounded in n. 
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Proof. Let c := c{A) be the Graver complexity of A and consider any n > c. We show 
that the Graver basis of A^"-^ is the union of (") suitably embedded copies of the Graver 
basis of A^^\ For every c indices 1 < ki < ■ ■ ■ < kc < n define a map (pki,...,kc from Z'^ to 
jnt ggnding x = (x^, . . . , x"^) to y = {y^, . . . , y") with := a;* for i = 1, . . . , c and y^ := 
for k ^ {ki, kc}. We claim that ^(A^")) is the union of the images of Q{A^'^'') under 
the (") maps (f)ki,...,kc for all 1 < /ci < • • • < /cc < n, that is, 

GiA^"-^) = U <l^ku...,kASiA^'^)) ■ (2) 

l<fci<---<A;c<n 

If X = (x^, . . . ,x'^ ) G then a; is a C-minimal nonzero element of C(A^'^^), imply- 

ing that <l)ki,...,kc{x) is a C-minimal nonzero element of C{A^^^) and therefore we have 
0fci,...,fcc(^) ^ ^(A^"''). So the right-hand side of (2) is contained in the left-hand side. 
Conversely, consider any y G ^(A*^"^). Then, by Lemma 4.6, the type of y is at most c, 
so there are indices 1 < ki < • • • < kc < n such that all nonzero components of y are 
among those of the reduced vector x := (|/^^, . . . , and therefore y = (j)k^,...,kX^)- Now, 
y G Q{A^^^) implies that y is a C-minimal nonzero element of C{A^^^) and hence a: is a 
□-minimal nonzero clement of C{A^^^). Therefore x G Q{A^'^'') and y G (t)ki,...,kg{Q{A'^'^^)). 
So the left-hand side of (2) is contained in the right-hand side. 

Since A is fixed we have that c = c{A) and Q{A^'^'>) are constant. Then (2) implies that 
< ^0{rf). Moreover, every element of ^(A^'*)) is an nt-dimensional 

vector (j)ki,...,kci^) obtained by appending zero components to some x G Q{A^'^^) and hence 
has linear binary length 0{n). So the binary length of the entire Graver basis ^(A*^"^) is 
(^^j^c+i-j_ Thus, the (") = 0{n'^) images <f>ki,...,kciGiA^''^)) and their union ^(A*^")) can be 
computed in strongly polynomial time, as claimed. □ 



Example 4.8 Consider the (2 + 1) x 2 matrix A with Ai := I2 the 2x2 identity and 
A2 := (1,1). Then ^(^2) = ±(1,-1) and ^(^1^2) = ±(1,1) from which the Graver 
complexity of A can be concluded to be c{A) — 2 (see the proof of Lemma 4.6). The 
2-fold matrix of A and its Graver basis, consisting of two antipodal vectors only, are 
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GiA^^'^) = ±( 1 -1 -1 1 ) 



By Theorem 4.7, the Graver basis of the 4-fold matrix A^^^ is computed to be the union 
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of the images of the 6 = (2) maps (f)ki,k2 '■ — for 1 < ^1 < ^2 < 4, getting 
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4.4 Linear N-fold Integer Programming in Polynomial Time 

We now proceed to provide a polynomial time algorithm for linear integer programming 
over n-fold matrices. First, combining the results of §4.2 and §4.3, we get at once the 
following polynomial time algorithm for converting any feasible point to an optimal one. 

Lemma 4.9 For every fixed {r + s) x t integer matrix A there is a polynomial time 
algorithm that, given n E N, l,u & U^, x,w E Z"* satisfying I < x < u, encoded as 
[(/, M, X, w)], solves the linear n-fold integer programming problem with b :— A^'^^x, 

max{wz : z G Z"*, A'^'^'^z = 6, I < z < u} . 

Proof. First, apply the polynomial time algorithm of Theorem 4.7 and compute the 
Graver basis g{A^'^^) of the n-fold matrix A^"^. Then apply the polynomial time algo- 
rithm of Theorem 4.4 to the data A^''\ ^(^l^^^), I, u, x and w. □ 

Next we show that an initial feasible point can also be found in polynomial time. 

Lemma 4.10 For every fixed {r + s) x t integer matrix A there is a polynomial time 
algorithm that, given n E N, l,u E Z^, and b G Z'""''"^*, encoded as [{l,u,b)], either finds 
an X E IT-^ satisfying I < x <u and A^'^^'x = b or asserts that none exists. 

Proof, li I ^ u then assert that there is no feasible point and terminate the algorithm. 
Assume then that / < u and determine some x G Z"^* with I < x <u and {x) < {l,u). 
Now, introduce n{2r + 2s) auxiliary variables to the given n-fold integer program and 
denote by x the resulting vector of n(t + 2r + 2s) variables. Suitably extend the lower 
and upper bound vectors to m by setting Ij := and Uj :— 00 for each auxiliary variable 
Xj. Consider the auxiliary integer program of finding an integer vector x that minimizes 
the sum of auxiliary variables subject to the lower and upper bounds I < x < u and the 
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following system of equations, with 1^ and Ig the r x r and s x s identity matrices, 
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This is again an n-fold integer program, with an (r + s) x (t + 2r + 2s) matrix A, where 
Ai = {Ai, Ir, —lr,0,0) and A2 = (/I2, 0, 0, J^, — /s). Since A is fixed, so is A. It is now 
easy to extend the vector x e Z"* determined above to a feasible point x of the auxiliary 
program. Indeed, put b :— b — A^'^^x e Z''+"''; now, for i = 1, . . . , r + ns, simply choose 
an auxiliary variable Xj appearing only in the i-th equation, whose coefficient equals the 
sign sign(6j) of the corresponding entry of b, and set Xj := \bi\. Define w G ^"(*+2'-+2s) 
setting w := for each original variable and w := —1 for each auxiliary variable, so that 
maximizing wx is equivalent to minimizing the sum of auxiliary variables. Now solve the 
auxiliary linear integer program in polynomial time by applying the algorithm of Lemma 
4.9 corresponding to A to the data n, I, u, x, and w. Since the auxiliary objective wx is 
bounded above by zero, the algorithm will output an optimal solution x*. If the optimal 
objective value is negative, then the original n-fold program is infeasible, whereas if the 
optimal value is zero, then the restriction of x* to the original variables is a feasible point 
X* of the original integer program. □ 

Combining Lemmas 4.9 and 4.10 we get at once the main result of this section. 

Theorem 4.11 For every fixed {r + s) x t integer matrix A there is a polynomial time 
algorithm that, given n, lower and upper bounds l,u & 71^, w e Z"*, and b e Z''+"*, 
encoded as [{I, u, w, b)], solves the following linear n-fold integer programming problem, 

maxfwx : x e Z"*, A^'^^x = b, I < x < u} . 

Again, as a special case of Theorem 4.11 we recover the following result of [16] con- 
cerning linear integer programming in standard form over n-fold matrices. 

Theorem 4.12 For every fixed {r + s) x t integer matrix A there is a polynomial time 
algorithm that, given n, linear functional w G Z"*, and right-hand side b G 11^'^^ , encoded 
as [(ly, 6)]; solves the following linear n-fold integer program in standard form, 

max{wx : x G N"*, A^'^'^x = b} . 
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4.5 Some Applications 



4.5.1 Three- Way Line-Sum Transportation Problems 

Transportation problems form a very important class of discrete optimization problems 
studied extensively in the operations research and mathematical programming literature, 
see e.g. [6, 42, 43, 53, 60, 62] and the references therein. We will discuss this class of 
problem and its applications to secure statistical data disclosure in more detail in §6. 

It is well known that 2-way transportation problems are polynomial time solvable, 
since they can be encoded as linear integer programs over totally unimodular systems. 
However, already 3-way transportation problem are much more complicated. Consider 
the following 3-way transportation problem over p x q xn tables with all line-sums fixed, 

max{wx : x G N^^^^" , ^Xij^k = Zj,k , ^Xij^k = Vi,k , ^Xij^k = } • 

i j k 

The data for the problem consist of given integer numbers (lines-sums) Uij, Vi^k, for 
i — 1, . . . ,p, j = 1, . . . ,q, k = 1, . . . , n, and a linear functional given by a p x g x n integer 
array w representing the transportation profit per unit on each cell. The problem is to 
find a transportation, that is, a, p x q x n nonnegative integer table x satisfying the line 
sum constraints, which attains maximum profit wx = X]f=i Xl^=i Ylt=i '^i,j,k^i,j,k- 

When at least two of the table sides, say p, q, are variable part of the input, and even 
when the third side is fixed and as small as n = 3, this problem is already universal for 
integer programming in a very strong sense [13, 15], and in particular is NP-hard [12]; 
this will be discussed in detail and proved in §6. Wc now show that in contrast, when 
two sides, say p,q, are fixed (but arbitrary), and one side n is variable, then the 3-way 
transportation problem over such long tables is an n-fold integer programming problem 
and therefore, as a consequence of Theorem 4.12, can be solved is polynomial time. 

Corollary 4.13 For every fixed p and q there is a polynomial time algorithm that, given 
n, integer profit array w e ^px^x^^^ ^ri^d Une-sums u e IF^'^, v e Z^^" and z e Z^^**, 
encoded as [{w,u,v, z)], solves the integer 3-way line-sum transportation problem 

max{wx : x e N^'""''''' , ^ Xij^k = ^j,fe , ^^-•'^ ^ ^^■'^ ' XI ^^-•'^ ^ '^'^^ ^ ' 

i j k 

Proof. Re-index p x q x n arrays as a; = (x^, . . . ,x'^) with each component indexed as 
x^ := {x^ j) '■= {xi^i^k, ■ ■ ■ ,Xp^q^k) suitably indexed as a pq vector representing the k-th 
layer of x. Put r := t := pq and s '■= p + q, and let A be the {r + s) x t matrix with 
^1 '■= Ipq the pq X pq identity and with A2 the {p-\-q) x pq matrix of equations of the usual 
2-way transportation problem for pxq arrays. Re-arrange the given line-sums in a vector 
h := (6°, b\..., h^) e Z'-+"* with 6° := {ui,j) and := ((^;^,fe), (z,- fe)) for A; = 1, . . . , n. 



36 



This translates the given 3-way transportation problem into an n-fold integer pro- 
gramming problem in standard form, 

max {wx : x e N"*, A^'^^x = b} , 

where the equations Ai(Ylk=i^^) ~ represent the constraints X]fc^*j,fc ~ ""jj 
line-sums where summation over layers occurs, and the equations A2x'^ = for k = 
l,...,n represent the constraints '^^^Xij^k = Zj^k and 'Ylij^i,j,k = '^i,k of all line-sums 
where summations are within a single layer at a time. 

Using the algorithm of Theorem 4.12, this n-fold integer program, and hence the given 
3-way transportation problem, can be solved in polynomial time. □ 

Example 4.14 We demonstrate the encoding of the pxqxn transportation problem as 
an n-fold integer program as in the proof of Corollary 4.13 for p = q = 3 (smallest case 
where the problem is genuinely 3-dimensional) . Here we put r :— t := 9, s := 6, write 

x'' :— {Xi^i^k, ^l,2,k, ^l,3,k, ^2,l,k, ^2,2,k, X2,3,k, X3,l,k, X3,2,k, X3,3,k) , k — 1, . . . ,n , 

and let the (9-1-6) x 9 matrix A consist of = Ig the 9x9 identity matrix and 

/lllOOOOOOX 

1 1 1 
. 000000111 
100100100 
1 1 1 

yooiooiooi/ 

Then the corresponding n-fold integer program encodes the 3 x 3 x n transportation 
problem as desired. Already for this case, of 3 x 3 x n tables, the only known polynomial 
time algorithm for the transportation problem is the one underlying Corollary 4.13. 

Corollary 4.13 has a very broad generalization to multiway transportation problems 
over long fc-way tables of any dimension k; this will be discussed in detail in §6. 

4.5.2 Packing Problems and Cutting-Stock 

We consider the following rather general class of packing problems which concern max- 
imum utility packing of many items of several types in various bins subject to weight 
constraints. More precisely, the data is as follows. There are t types of items. Each item 
of type j has integer weight Vj. There are rij items of type j to be packed. There are 
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n bins. The weight capacity of bin k is an integer uj^. Finally, there is a utility matrix 
w G Z*^" where Wj^k is the utility of packing one item of type j in bin k. The problem is to 
find a feasible packing of maximum total utility. By incrementing the number t of types 
by 1 and suitably augmenting the data, we may assume that the last type t represents 
"slack items" which occupy the unused capacity in each bin, where the weight of each 
slack item is 1, the utility of packing any slack item in any bin is 0, and the number of 
slack items is the total residual weight capacity rit := — X]j=i ^i'^i- Let x G N*^" 

be a variable matrix where Xj^k represents the number of items of type j to be packed in 
bin k. Then the packing problem becomes the following linear integer program, 

m.SiX.{wx : X G N*^** , VjXj^k — Uk , Xj^k — '>T'j} ■ 

j k 

We now show that this is in fact an n-fold integer programming problem and therefore, 
as a consequence of Theorem 4.12, can be solved is polynomial time. While the number t 
of types and type weights vj are fixed, which is natural in many bin packing applications, 
the numbers nj of items of each type and the bin capacities Uk may be very large. 

Corollary 4.15 For every fixed number t of types and integer type weights Vi, . . . ,Vt, 
there is a polynomial time algorithm that, given n bins, integer item numbers ni, . . . ,nt, 
integer bin capacities ui,...,Un, and txn integer utility matrix w, encoded as 
[(rii, . . . ^UtiUi, . . . ,Uniw)], solves the following integer bin packing problem, 

m.SiX.{wx : x G M*^'^ , VjXj^k = Uk , Xj^k = ^j} ■ 

j k 

Proof. Re-index the variable matrix as a; = (a;^, . . . , a;") with J ) where a;^ 

represents the number of items of type j to be packed in bin k for al j and k. Let A be the 
{t-\-l)xt matrix with Ai :— It the t identity and with A2 :— {vi, . . . ,vt) a single row. Re- 
arrange the given item numbers and bin capacities in a vector b := {W, b^, . . . , 6") G Z*"*"" 
with W := {ui, . . . ,nt) and 6^ := Uk for all k. This translates the bin packing problem 
into an n-fold integer programming problem in standard form, 

mecK{wx : x G N""*, A^'^^x = b} , 

where the equations Ai(J2^=ix'^) — represent the constraints J2k^j,k — f^j assuring 
that all items of each type are packed, and the equations A2X^ = b^ for k = 1, . . . ,n 
represent the constraints VjXj^k = Uk assuring that the weight capacity of each bin is 
not exceeded (in fact, the slack items make sure each bin is perfectly packed). 

Using the algorithm of Theorem 4.12, this n-fold integer program, and hence the given 
integer bin packing problem, can be solved in polynomial time. □ 
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Example 4.16 (cutting-stock problem). This is a classical manufacturing problem 
[27], where the usual setup is as follows: a manufacturer produces rolls of material (such as 
scotch-tape or band-aid) in one of t different widths Vi, . . . ,Vt. The rolls are cut out from 
standard rolls of common large width u. Given orders by customers for rij rolls of width 
Vj, the problem facing the manufacturer is to meet the orders using the smallest possible 
number of standard rolls. This can be cast as a bin packing problem as follows. Rolls 
of width Vj become items of type j to be packed. Standard rolls become identical bins, 
of capacity Uk := u each, where the number of bins is set to be n := X]j=i T^j/L'^/'^jJl 
which is sufficient to accommodate all orders. The utility of each roll of width vj is set 
to be its width negated Wj^k '■— —Vj regardless of the standard roll k from which it is cut 
(paying for the width it takes). Introduce a new roll width vq :— 1, where rolls of that 
width represent "slack rolls" which occupy the unused width of each standard roll, with 
utility WQ^k '■= —1 regardless of the standard roll k from which it is cut (paying for the 
unused width it represents), with the number of slack rolls set to be the total residual 
width no :— nu — Xlj=i'^i^i- Then the cutting-stock problem becomes a bin packing 
problem and therefore, by Corollary 4.15, for every fixed t and fixed roll widths Vi, . . . ,Vti 
it is solvable in time polynomial in X]j=i T^i/ [""/"^iJl ^.nd (ni, . . . ,nt,u). 

One common approach to the cutting-stock problem uses so-called cutting patterns, 
which are feasible solutions of the knapsack problem {y e N* : Y^j^iVjUj < u}. This is 
useful when the common width u of the standard rolls is of the same order of magnitude 
as the demand role widths Vj. However, when u is much larger than the Vj, the number 
of cutting patterns becomes prohibitively large to handle. But then the values [w/^jj are 
large and hence n := Yl]=i\''^j/['^/'^j\~\ small, in which case the solution through the 
algorithm of Corollary 4. 15 becomes particularly appealing. 
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5 Convex Integer Programming 



In this section we discuss convex integer programming. In particular, we extend the theory 
of §4 and show that convex n-fold integer programming is polynomial time solvable as 
well. In §5.1 we discuss convex integer programming over totally unimodular matrices. 
In §5.2 we show the applicability of Graver bases to convex integer programming. In §5.3 
we combine Theorem 2.4, the results of §4, and the preparatory facts from §§5.2, and 
prove the main result of this section, asserting that convex n-fold integer programming is 
polynomial time solvable. We conclude with some applications in §5.4. 

As in §4, the feasible set S is presented as the set of integer points satisfying an 
explicitly given system of linear inequalities, given in one of the forms 

5 := {x e : Ax = 6} or S :^ {x E : Ax ^ b, I < x < u} , 

with matrix A e Z"^^", right-hand side b e Z"*, and lower and upper bounds l,u E Z^. 

As demonstrated in §1.1, if the polyhedron P :— {x E : Ax — b, I < x < u} 
is unbounded then the convex integer programming problem with an oracle presented 
convex fimctional is rather hopeless. Therefore, an algorithm that solves the convex 
integer programming problem should either return an optimal solution, or assert that the 
program is infeasible, or assert that the underlying polyhedron is unbounded. 

Nonetheless, we do allow the lower and upper bounds l,u to lie in Z^ rather than 
Z", since often the polyhedron is bounded even though the variables are not bounded 
explicitly (for instance, if each variable is bounded below only, and appears in some equa- 
tion all of whose coefficients are positive). This results in broader formulation flexibility. 
Furthermore, in the next subsections we prove auxiliary lemmas asserting that certain 
sets cover all edge-directions of relevant polyhedra, which do hold also in the unbounded 
case. So we now extend the notion of edge-directions, defined in §2.1 for polytopes, to 
polyhedra. A direction of an edge (1-dimensional face) e of a polyhedron P is any nonzero 
scalar multiple oiy — x where a;, y are any two distinct points in e. As before, a set covers 
all edge-directions of P if it contains a direction of each edge of P. 

5.1 Convex Integer Programming over Totally Unimodular Sys- 
tems 

A matrix A is totally unimodular if the determinant of every square submatrix of A lies in 
{ — 1, 0, 1}. Such matrices arise naturally in network flows, ordinary (2-way) transportation 
problems, and many other situations. A fundamental result in integer programming [37] 
asserts that polyhedra defined by totally unimodular matrices arc integer. More precisely, 
if >1 is an m X n totally unimodular matrix, /, m e Z^, and b e Z"*, then 

Pi := conv{x e Z" : Ax = 6, l<x<u} = {xeW : Ax^b, I < x < u} -.^ P , 



40 



that is, the underlying polyhedron P coincides with its integer hull Pj. This has two 
consequences useful in facilitating the solution of the corresponding convex integer pro- 
gramming problem via the algorithm of Theorem 2.4. First, the corresponding linear 
integer programming problem can be solved by linear programming over P in polynomial 
time. Second, a set covering all edge-directions of the implicitly given integer hull Pj, 
which is typically very hard to determine, is obtained here as a set covering all edge- 
directions of P which is explicitly given and hence easier to determine. 

We now describe a well known property of polyhedra of the above form. A circuit of 
a matrix A e Z'"^" is a nonzero primitive minimal support element of jC.{A). So a circuit 
is a nonzero c e Z" satisfying Ac — 0, whose entries are relatively prime integers, such 
that no nonzero c' with A(^ — has support strictly contained in the support of c. 

Lemma 5.1 For every A G Z™^", /,« e ZJ^, and b G Z™, the set of circuits of A covers 
all edge- directions of the polyhedron P := {a; G M" : Ax = b, I < x < u}. 

Proof. Consider any edge e of P. Pick two distinct points x.y E e and set g := y — x. 
Then Ag = and therefore, as can be easily proved by induction on |supp(5')|, there is 
a finite decomposition g = with ctj positive real number and q circuit of A such 

that aiCi C g for all i, where □ is the natural extension from Z" to of the partial order 
defined in §4.2. We claim that x + aiCi G P for all i. Indeed, q being a circuit implies 
A{x + ctjCi) = Ax = b; and I < x,x + g < u and QfjCj □ g imply I < x + aiCi < u. 

Now let w G be a linear functional uniquely maximized over P at the edge e. Then 
waiCi = w{x + aiCi) — wx < for all i. But ^(wajQ) = wg = wy — wx = 0, implying 
that in fact waiCi — and hence x + aiCi G e for all i. This implies that each q is a 
direction of e (in fact, all Cj are the same and g is a multiple of some circuit). □ 

Combining Theorem 2.4 and Lemma 5.1 we obtain the following statement. 

Theorem 5.2 For every fixed d there is a polynomial time algorithm that, given m x n 
totally unimodular matrix A, set C C Z" containing all circuits of A, vectors l,u & Z^, 
b G Z™', and wi, . . . ,Wd & Z", and convex c : R'^ — > M presented by a comparison oracle, 
encoded as [{A, C,l,u,b,wi, . . . , Wd)], solves the convex integer program 

max{c{wix, . . . , Wdx) : x G Z", Ax — b, I < x < u} . 

Proof. First, check in polynomial time using linear programming whether the objective 
function of any of the following 2n linear programs is unbounded, 

max {±yi : y G P}, i = 1, . . . , n, P := {y G : Ay ^ b, I <y <u} . 
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If any is unbounded then terminate, asserting that P is unbounded. Otherwise, let p be 
the least integer upper bound on the absohitc value of all optimal objective values. Then 
P C [— p, p]" and S := {y E : Ay = b, I < y < u} G P is finite of radius p{S) < p. In 
fact, since A is totally unimodular, Pj = P = conv(5') and hence p{S) = p. Moreover, by 
Cramer's rule, (p) is polynomially bounded in {A,l,u,x). 

Now, since A is totally unimodular, using linear programming over Pj — P we can 
simulate in polynomial time a linear discrete optimization oracle for S. By Lemma 
5.1, the given set C, which contains all circuits of A, also covers all edge-directions of 
conv(S') = Pi = P. Therefore we can apply the algorithm of Theorem 2.4 and solve the 
given convex n-fold integer programming problem in polynomial time. □ 

While the number of circuits of an m x n matrix A can be as large as 2(^"-^) and 
hence exponential in general, it is nonetheless relatively small in that it is bounded in 
terms of m and n only and is independent of the matrix A itself. Furthermore, it may 
happen that the number of circuits is much smaller than the upper bound 2(^"-^). Also, 
if in a class of matrices, m grows slowly in terms of n, say m — O(logn), then this bound 
is subexponential. In such situations, the above theorem may provide a good strategy for 
solving convex integer programming over totally unimodular systems. 



5.2 Graver Bases and Convex Integer Programming 

We now extend the statements of §5.1 about totally unimodular matrices to arbitrary 
integer matrices. The next lemma shows that the Graver basis of any integer matrix 
covers all edge-directions of the integer hulls of polyhedra defined by that matrix. 

Lemma 5.3 For every A E Z"^^", l,u E Z^, and b E TT- , the Graver basis Q{A) of A 
covers all edge- directions of the polyhedron Pi :— conv{x E : Ax — b, I < x <u}. 

Proof. Consider any edge e of Pi and pick two distinct points x,y E enli". Then 
g :— y — x ism C{A) \ {0}. Therefore, by Lemma 4.2, there is a conformal sum g — Y2i hi 
with hi E Q{A) for all i. We claim that x + hi E Pi for all i. Indeed, first note that 
hi E Q{A) C C{A) implies Ahi = and hence A{x + hi) = Ax — b; and second note that 
I < x,x + g < u and hi ^ g imply that I < x + hi < u. 

Now let w e Z" be a linear functional uniquely maximized over Pi at the edge e. Then 
whi — w{x + hi) — wx < for all i. But '^{whi) — wg — wy — wx — 0, implying that in 
fact whi — and hence x + hi E e for all i. Therefore each hi is a direction of e (in fact, 
all hi are the same and g is a multiple of some Graver basis element). □ 

Combining Theorems 2.4 and 4.4 and Lemma 5.3 we obtain the following statement. 
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Theorem 5.4 For every fixed d there is a polynomial time algorithm that, given in- 
teger m X n matrix A, its Graver basis G{A), l,u & Z^, a; G with I < x < u, 
Wi, . . . ,Wd G Z", and convex c : — > M presented by a comparison oracle, encoded as 
[{A, Q{A), I, u, X, wi, . . . , Wd)], solves the convex integer program with b := Ax, 

max{c('u;i2;, . . . jWdz) : z e Z", Az — b, I < z < u} . 

Proof. First, check in polynomial time using linear programming whether the objective 
function of any of the following 2n linear programs is unbounded, 

max{±yi : y e P}, i = 1,. . .,n, P := {y : Ay = b, I <y <u} . 

If any is unbounded then terminate, asserting that P is unbounded. Otherwise, let p be 
the least integer upper bound on the absolute value of all optimal objective values. Then 
P ^ [-p,p]" and S := {y e Z^' : Ay = b, I < y < u} C P is finite of radius p{S) < p. 
Moreover, by Cramer's rule, (p) is polynomially bounded in {A,l,u,x). 

Using the given Graver basis and applying the algorithm of Theorem 4.4 we can 
simulate in polynomial time a linear discrete optimization oracle for S. Furthermore, 
by Lemma 5.3, the given Graver basis covers all edge-directions of the integer hull 
Pj := conv{y G Z" : Ay = b,l < y < u} = conv(iS'). Therefore we can apply the al- 
gorithm of Theorem 2.4 and solve the given convex program in polynomial time. □ 



5.3 Convex N-fold Integer Programming in Polynomial Time 

We now extend the result of Theorem 4.11 and show that convex integer programming 
problems over n-fold systems can be solved in polynomial time as well. As explained in 
the beginning of this section, the algorithm either returns an optimal solution, or asserts 
that the program is infeasible, or asserts that the underlying polyhedron is unbounded. 

Theorem 5.5 For every fixed d and fixed {r + s) x t integer matrix A there is a polyno- 
mial time algorithm that, given n, lower and upper bounds l,u & Z^, wi, . . . G Z"*, 
b G U^'^^ , and convex functional c : R.^ — > M presented by a comparison oracle, encoded 
as [{I, u, Wi, . . . , Wd, b)], solves the convex n-fold integer programming problem 

max{c{wix, Wdx) : x G Z"*, ^^"^x = 6, I <x <u} . 

Proof. First, check in polynomial time using linear programming whether the objective 
function of any of the following 2nt linear programs is unbounded, 

max{±yi : y e P}, i ^ 1, . . . ,nt, P := {y G R"* : = 6, I <y <u} . 
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If any is unbounded then terminate, asserting that P is unbounded. Otherwise, let p be 
the least integer upper bound on the absolute vahic of all optimal objective values. Then 
P C [-p, p]"* and 5 := G Z"* : A^'^^y = b, I < y < u} C P is finite of radius p{S) < p. 
Moreover, by Cramer's rule, (p) is polynomially bounded in n and {l,u,b). 

Using the algorithm of Theorem 4.11 we can simulate in polynomial time a linear 
discrete optimization oracle for S. Also, using the algorithm of Theorem 4.7 we can 
compute in polynomial time the Graver basis Q{A^^'^) which, by Lemma 5.3, covers all 
edge-directions of Pj := conv{y G Z"* : A'-"^?/ = b,l < y < u} = conv(S'). Therefore 
we can apply the algorithm of Theorem 2.4 and solve the given convex n-fold integer 
programming problem in polynomial time. □ 

Again, as a special case of Theorem 5.5 we recover the following result of [17] concern- 
ing convex integer programming in standard form over n-fold matrices. 

Theorem 5.6 For every fixed d and fixed {r -\- s) x t integer matrix A there is a poly- 
nomial time algorithm that, given n, linear junctionals Wi, . . . ,Wd& Z"*, right-hand side 
b G Z''^"'*, and convex functional c : M.'^ — > M presented by a comparison oracle, encoded 
as [{wi, . . . , Wd, b)], solves the convex n-fold integer program in standard form 



5.4 Some Applications 

5.4.1 Transportation Problems and Packing Problems 

Theorems 5.5 and 5.6 generalize Theorems 4.11 and 4.12 by broadly extending the class 
of objective functions that can be maximized in polynomial time over n-fold systems. 
Therefore all apphcations discussed in §4.5 automatically extend accordingly. 

First, we have the following analog of Corollary 4.13 for the convex integer trans- 
portation problem over long 3-way tables. This has a very broad further generalization to 
multiway transportation problems over long fc-way tables of any dimension k, see §6. 

Corollciry 5.7 For every fixed d,p,q there is a polynomial time algorithm that, given 
n, arrays Wi, . . . ,Wd G Z^^^xn^ line-sums u G V^'^, v G Z**^" and z G Z'^^", and convex 
functional c : — > M presented by a comparison oracle, encoded as [{wi, . . . , w^, u, v, z)], 
solves the convex integer 3-way line-sum transportation problem 



meix{c{wix, wax) : x G N"*, A'^^'^'x = b} . 



max{ c{wix, . . . , Wdx) 



x G W""^ 




3 



k 
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Second, we have the following analog of Corollary 4.15 for convex bin packing. 

Corollciry 5.8 For every fixed d, number of types t, and type weights Vi, . . . ,Vt & there 
is a polynomial time algorithm that, given n bins, item numbers ni, . . . , G Z, bin capaci- 
ties Ml, ... , Un G Z, utility matrices Wi, . . . ,Wd E Z*^", and convex functional c : M'^ — > M 
presented by a comparison oracle, encoded as [{ni, . . . ,nt,ui, . . . , Un, Wi, . . . , Wd)], solves 
the convex integer bin packing problem, 

max{ c{wix, . . . , wax) : x e N*""" , ^ vjXj^k = , ^ xj^k = } ■ 

j k 

5.4.2 Vector Peirtitioning and Clustering 

The vector partition problem concerns the partitioning of n items among p players to 
maximize social value subject to constraints on the number of items each player can 
receive. More precisely, the data is as follows. With each item i is associated a vector 
Vi e Z^ representing its utility under k criteria. The utility of player h under ordered 
partition tt = (tti, . . . , tt^) of the set of items {1, . . . , n} is the sum := X^j^^^ Vi of 
utility vectors of items assigned to h under tt. The social value of tt is the balancing 
c(f . . . , v^ ,,, . . . , Vp i, . . . , Vp ,,) of the player utilities, where c is a convex functional on 
MP'^. In the constrained version, the partition must be of a given shape, i.e. the number 
Itt/jI of items that player h gets is required to be a given number (with ^ A/i = n). In 
the unconstrained version, there is no restriction on the number of items per player. 

Vector partition problems have applications in diverse areas such as load balancing, 
circuit layout, ranking, cluster analysis, inventory, and reliability, see e.g. [7, 9, 25, 39, 50] 
and the references therein. Here is a typical example. 

Example 5.9 (minimal variance clustering). This problem has numerous applica- 
tions in the analysis of statistical data: given n observed points Vi,...,Vn in /c-space, 
group them into p clusters tti, . . . , tt^ that minimize the sum of cluster variances given by 

E^Eii-<-(^E''.)ir 

h=l ' ieiTh ' ' ienh 

Consider instances where there are n — pm points and the desired clustering is balanced, 
that is, the clusters should have equal size m. Suitable manipulation of the sum of 
variances expression above shows that the problem is equivalent to a constrained vector 
partition problem, where Xh = m for all h, and where the convex functional c 
(to be maximized) is the Euclidean norm squared, given by 

p k 

c{z) = ll^ll^ = ^^M'^ ■ 

h=l i=l 
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If either the number of criteria k or the number of players p is variable, the partition 
problem is intractable since it instantly captures NP-hard problems [39]. When both k,p 
are fixed, both the constrained and unconstrained versions of the vector partition problem 
are polynomial time solvable [39, 50]. We now show that vector partition problems (either 
constrained or unconstrained) are in fact convex n-fold integer programming problems and 
therefore, as a consequence of Theorem 5.6, can be solved is polynomial time. 

Corollary 5.10 For every fixed number p of players and number k of criteria, there is a 
polynomial time algorithm that, given n, item vectors vi,...,Vn G Z''', Ai, . . . , Ap e 
and convex functional c : W'' — > R presented by a comparison oracle, encoded as 
[{vi, . . . ,Vn, Xi, . . . , Xp)], solves the constrained and unconstrained partitioning problems. 

Proof. There is an obvious one-to-one correspondence between partitions and matrices 
X e {0, with all column-sums equal to one, where partition tt corresponds to the 
matrix x with Xh,i — 1 ii i & tth and Xh,i — otherwise. Let d :— pk and define d matrices 
Wh,j G Z^^" by setting {wh,j)h,i '■= Vij for all /i = 1. . . . ,p, i = 1, . . . ,n and j — 1, . . . ,k, 
and setting all other entries to zero. Then for any partition vr and its corresponding matrix 
X we have v^j = Whjx for all /i = 1, . . . ,p and j = 1, . . . , k. Therefore, the unconstrained 
vector partition problem is the convex integer program 

max{ c{wi^ix, Wp^kx) : x e , ^ Xh,i = 1 } ■ 

h 

Suitably arranging the variables in a vector, this becomes a convex n-fold integer program 
with a (0 + 1) X p defining matrix A, where Ai is empty and A2 :=(!,..., 1). 

Similarly, the constrained vector partition problem is the convex integer program 

max{ c{wi^ix, Wp^kx) : x e , ^ Xh,i = 1 , XI ^'^'^ = } ■ 

h i 

This again is a convex n-fold integer program, now with a (p -|- 1) x p defining matrix A, 
where now Ai := Ip is the p x p identity matrix and A2 := (1, . . . , 1) as before. 

Using the algorithm of Theorem 5.6, this convex n-fold integer program, and hence 
the given vector partition problem, can be solved in polynomial time. □ 
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6 Multiway Transportation Problems and Privacy in 
Statistical Databases 



Transportation problems form a very important class of discrete optimization problems. 
The feasible points in a transportation problem are the multiway tables ("contingency 
tables" in statistics) such that the sums of entries over some of their lower dimensional 
sub-tables such as hues or planes ("margins" in statistics) are specified. Transportation 
problems and their corresponding transportation polytopes have been used and studied 
extensively in the operations research and mathematical programming literature, as well 
as in the statistics literature in the context of secure statistical data disclosure and man- 
agement by public agencies, see [4, 6, 11, 18, 19, 42, 43, 53, 60, 62] and references therein. 

In this section we completely settle the algorithmic complexity of treating multiway 
tables and discuss the applications to transportation problems and secure statistical data 
disclosure, as follows. After introducing some terminology in §6.1, we go on to describe, 
in §6.2, a universality result that shows that "short" 3-way r x c x 3 tables, with variable 
number r of rows and variable number c of columns but fixed small number 3 of layers 
(hence "short"), are universal in a very strong sense. In §6.3 we discuss the general 
multiway transportation problem. Using the results of §6.2 and the results on linear 
and convex n-fold integer programming from §4 and §5, we show that the transportation 
problem is intractable for short 3-way r x c x 3 tables but polynomial time treatable for 
"long" {k + l)-way mi x • • • x x n tables, with k and the sides mi, ... , m^ fixed (but 
arbitrary), and the number n of layers variable (hence "long"). In §6.4 we turn to discuss 
data privacy and security and consider the central problem of detecting entry uniqueness 
in tables with disclosed margins. We show that as a consequence of the results of §6.2 
and §6.3, and in analogy to the complexity of the transportation problem established in 
§6.3, the entry uniqueness problem is intractable for short 3-way r x c x 3 tables but 
polynomial time decidable for long {k + l)-way mi x • • • x m^ x n tables. 

6.1 Tables and Margins 

We start with some terminology on tables, margins and transportation polytopes. 
A k-way table is an mi x • • • x m^ array x — (xj^^..._jj,) of nonnegative integers. A k-way 
transportation polytope (or simply k-way polytope for brevity) is the set of all mi x • • • x m^ 
nonnegative arrays x = (a;jj,...,j^) such that the sums of the entries over some of their lower 
dimensional sub-arrays (margins) are specified. More precisely, for any tuple {ii, . . . 
with ij G {1, . . . ,mj} U {+}, the corresponding margin Xi^^,,,^i^_ is the sum of entries 
of X over all coordinates j with ij — -|-. The support of (ii,...,ifc) sjciA. of is 
the set supp(ii, . . . , ifc) := {j : ij ^ +} of non-summed coordinates. For instance, if 
a;isa4x5x3x2 array then it has 12 margins with support F — {1,3} such as 
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2^3,+,2,+ = Y1^2=i X]i4=i ^3,«2,2,j4- A collection of margins is hierarchical if, for some family 
JF of subsets of {!,..., k}, it consists of all margins Ui^^^^^^i^ with support in JF. In par- 
ticular, for any < h < k, the collection of all h-margins of /c-tables is the hierarchical 
collection with the family of all /i-subsets of {1, . . . , /c}. Given a hierarchical collection 
of margins supported on a family T of subsets of {1, . . . , k}, the corresponding 

k-way polytope is the set of nonnegative arrays with these margins, 

TjF := {a; e M+ : a;^^,...,^^ = ^.^^ , supp(ii, . . . , 4) G JF j . 
The integer points in this polj^ope are precisely the k-way tables with the given margins. 

6.2 The Universality Theorem 

We now describe the following universality result of [13, 15] which shows that, quite 
remarkably, any rational polytope is a short 3-way r x c x 3 polytope with all line-sums 
specified. (In the terminology of §6.1 this is the r x c x 3 polytope Tjr of all 2-margins 
fixed, supported on the family = {{1, 2}, {1, 3}, {2, 3}}.) By saying that a polytope 
P C M*' is representable as a polytope Q C we mean in the strong sense that there is 
an injection a : {1, . . . ,p} — > {1, . . . ,q} such that the coordinate-erasing projection 

TT : W > W : X = {Xi, . . . ,Xq) 7r{x) = {Xa{l), . . . , Xa(p)) 

provides a bijection between Q and P and between the sets of integer points Q n and 
P n Z*'. In particular, if P is representable as Q then P and Q are isomorphic in any 
reasonable sense: they are linearly equivalent and hence all linear programming related 
problems over the two are polynomial time equivalent; they are combinatorially equivalent 
and hence they have the same face numbers and facial structure; and they are integer 
equivalent and therefore all integer programming and integer counting related problems 
over the two are polynomial time equivalent as well. 

We provide only an outline of the proof of the following statement; complete details 
and more consequences of this theorem can be found in [13, 15]. 

Theorem 6.1 There is a polynomial time algorithm that, given A G Z™^" and b G Z™, 
encoded as [{A, b)], produces r, c and line-sums u G Z^^'^, v G Z**^^ and z G Z'^^^ such that 
the polytope P := {y E : Ay = b} is representable as the 3-way polytope 

T :— {x G R_|. : ^ ^ ^i,j,k — Zj,k ■, ^ ^ ^i,j,k — '^i,k i ^ ^ ^i,j,k — } ■ 

i 3 k 

Proof. The construction proving the theorem consists of three polynomial time steps, 
each representing a polytope of a given format as a polytope of another given format. 
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First, we show that any P .— {y > : Ay = b} with A, b integer can be represented 
in polynomial time as Q := {a; > : Cx = d} with C matrix all entries of which are 
in { — 1,0, 1,2}. This reduction of coefficients will enable the rest of the steps to run in 
polynomial time. For each variable yj let kj := max{|_log2 : i = 1, . . .m} be the 

maximum number of bits in the binary representation of the absolute value of any entry Uij 
of A. Introduce variables Xj^, . . . , Xj^kj, and relate them by the equations j — j+i = 0. 
The representing injection a is defined by a{j) := (j, 0), embedding Consider 
any term aij yj of the original system. Using the binary expansion \aij\ = Yl's=o ^s'^^ with 
all ts G {0,1}, we rewrite this term as ± X]s=o hard to verify that this 

represents P as Q with defining { — 1, 0, 1, 2}-matrix. 

Second, we show that any Q '■= {y > : Ay = b} with A, b integer can be represented 
as a face F of a 3-way polytope with all plane-sums fixed, that is, a face of a 3-way 
polytope Tjr of all 1-margins fixed, supported on the family — {{1}, {2}, {3}}. 

Since Q is a polytope and hence bounded, we can compute (using Cramer's rule) an 
integer upper bound U on the value of any coordinate yj of any y & Q. Note also that a 
face of a 3-way polytope TV is the set of all x = {xij^k) with some entries forced to zero; 
these entries are termed "forbidden" , and the other entries are termed "enabled" . 

For each variable yj, let rj be the largest between the sum of positive coefficients of 
yj and the sum of absolute values of negative coefficients of yj over all equations, 

Tj := max ( ^{a^j : a^j > 0} , J^{|afej| : a^j < 0} 

\ k k 

Assume that A is of size m x n. Let r := ^^^i^j, R := {1, . . . ,r}, h := m + 1 and 
H :— {1, . . . , h}. We now describe how to construct vectors u,v E 7/',z E Z'*, and a set 
EcRxRxHoi triples - the enabled, non-forbidden, entries - such that the polytope 
Q is represented as the face F of the corresponding 3-way polytope oi r x r x h arrays 
with plane-sums u, v, z and only entries indexed by E enabled, 

F := {xeR^^^'* : = for all A;) ^ F , and 

^ ] -^iJife ~ ) ^ ] ^i,j,k — Vj , ^ ] ^i,j,k — '^i} ■ 
i,j i^k jyk 

We also indicate the injection a : {1, . . . , n} — RxRxH giving the desired embedding 

of coordinates yj as coordinates .t,; ^ and the representation of Q as F. 

Roughly, each equation = 1, . . . ,m is encoded in a "horizontal plane" R x R x {k} 
(the last plane R x R x {h} is included for consistency with its entries being "slacks"); 
and each variable yj, j = 1, . . . ,n is encoded in a "vertical box" Rj x Rj x H, where 
R — l+J"=i Rj is the natural partition of it! with \Rj\ — Vj for all j = 1, ... , n, that is, with 

Rj — {1 + Ei<j n, ■ ■ ■ , Y.i<j n}- 
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Now, all "vertical" plane-sums are set to the same value U, that is, Uj := Vj :— U 
for j = 1, . . . ,r. All entries not in the union l^^^iRj x Rj x H of the variable boxes 
will be forbidden. We now describe the enabled entries in the boxes; for simplicity we 
discuss the box Ri x Ri x H, the others being similar. We distinguish between the two 
cases ri — 1 and ri > 2. In the first case, Ri — {!}; the box, which is just the single 
line {1} X {1} X H, will have exactly two enabled entries (1, 1, A;"*"), (1, 1, k~) for suitable 
k^, k~ to be defined later. We set a{l) := (1, 1, A;"*"), namely embed yi = 2:11^+. We define 
the complement of the variable yi to be yi := U — yi (and likewise for the other variables). 
The vertical sums m, v then force yi = U — yi = U — Xi^i^k+ = ^i,i,k-, so the complement 
of yi is also embedded. Next, consider the case ri > 2. For each s — 1, . . . ,ri, the hne 
{s} X {s} X H (respectively, {s} x {1 + (s mod ri)} x H) will contain one enabled entry 
(s, s, /c+(s)) (respectively, (s, 1 + (s mod ri),k~{s)). All other entries of Ri x Ri x H 
will be forbidden. Again, we set a{l) := (1, 1, A;"''(l)), namely embed yi = a;i^i^fc+(i); it is 
then not hard to see that, again, the vertical sums u, v force Xs,s,k+{s) = Xi,i,k+W ~ ^1 ^^"^ 
Xs,i+(s mod ri),fe-(s) - = Tji for each s = 1, . . . , ri. Therefore, both yi and yi 

are each embedded in ri distinct entries. 

We now encode the equations by defining the horizontal plane-sums z and the indices 
k~^{s),k~{s) above as follows. For k = l,...,m, consider the k-th equation 
Tl/j'^kjUj = bk- Define the index sets J"*" := {j : ak,j > 0} and := {j : Gkj < 0}, and 
set Zk := bk + U ■ X^j^j- \(ik,j\- The last coordinate of z is set for consistency with u, v to 
be Zh — Zm+i :— r • U — YL^=i ^k- Now, with yj :— U — yj the complement of variable yj 
as above, the A;-th equation can be rewritten as 

n 

jeJ+ jeJ- j=i jeJ- jeJ- 

To encode this equation, we simply "pull down" to the corresponding k-th horizontal plane 
as many copies of each variable yj or yj by suitably setting k~^{s) := k or k^{s) := k. By 
the choice of rj there are sufficiently many, possibly with a few redundant copies which 
are absorbed in the last hyperplane by setting /c+(s) :— m -\- 1 or k~{s) :— m -\- 1. This 
completes the encoding and provides the desired representation. 

Third, we show that any 3-way polytope with plane-sums fixed and entry bounds, 

F:={ye R^^^x" : ^ y^^j^^ = Ck,Y = ^3^Y = ' ^^-^'^ - ^^-^^^ ' 

i^j i,k j^k 

can be represented as a 3-way polytope with line-sums fixed (and no entry bounds), 
T := {x e W^"""^ : Y ^i,J,k = zj,k , Y ^^"^'^ ^ ^^'^ ' Y ^^'-^'^ ^ ^^"^ ^ ' 

I J K 
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In particular, this imphes that any face F of a 3-way polytope with plane-sums fixed can 
be represented as a 3-way polytope T with line-sums fixed: forbidden entries are encoded 
by setting a "forbidding" upper-bound Cij^k '■= on all forbidden entries k) ^ E and 
an "enabling" upper-bound Cij^k '■= U on all enabled entries k) e E. We describe the 
presentation, but omit the proof that it is indeed valid; further details on this step can 
be found in [12, 13, 15]. We give explicit formulas for it/ j, vj^Ki zj,k in terms of a^, bj, Ck 
and Cj jfc as follows. Put r := I ■ m and c :— n -\- 1 -\- m. The first index I of each entry 
xi,j,K will be a pair / = in the r-set 



The last index K will simply range in the 3-set {1,2,3}. We represent F as T via 
the injection a given explicitly by k) := (1, k), 1), embedding each variable 

yi,j,k as the entry a;(jj)^(i^fc)^i. Let U now denote the minimal between the two values 
maxjoi, . . . ,ai} and max{6i, . . . , 6^}. The line-sums (2-margins) are set to be 



Applying the first step to the given rational polytope P, applying the second step to 
the resulting Q, and applying the third step to the resulting F, we get in polynomial time 
a 3-way r x c x 3 polytope T of all line-sums fixed representing P as claimed. □ 



6.3 The Complexity of the Multiway Transportation Problem 

We are now finally in position to settle the complexity of the general multiway transporta- 
tion problem. The data for the problem consists of: positive integers k (table dimension) 
and mi, ... , rrik (table sides); family T of subsets of {1, . . . ,k} (supporting the hierar- 
chical collection of margins to be fixed); integer values Ui^^,„^i^ for all margins supported 



{(1, 1), . . . , (1, m), (2, 1), . . . , (2, m), . . . , (Z, 1), . . . , (Z, m)} . 



The second index J of each entry .t/ jj^ will be a pair J = (s, t) in the c-set 



{(1, 1), . . . , (1, n), (2, 1), . . . , (2, 0, (3, 1), ... , (3, m)} . 
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on T\ and integer "profit" mi x ■ ■ ■ x array w. The transportation problem is to 

find an mi x ■ ■ ■ x m^ tabic having the given margins and attaining maximum profit, or 
assert than none exists. Equivalently, it is the linear integer programming problem of 
maximizing the linear functional defined by w over the transportation polytope Tjp, 

maxjwx : ^ e N'"i'' : Xi^^,„,i^ = , supp(ii, . . . , ife) GJT} . 

The following result of [12] is an immediate consequence of Theorem 6.1. It asserts 
that if two sides of the table are variable part of the input then the transportation problem 
is intractable already for short 3-way tables with T = {{1, 2}, {1, 3}, {2, 3}} supporting 
all 2- margins (line-sums). This result can be easily extended to /c-way tables of any 
dimension k > 3 and JF the collection of all /i-subsets of {1, . . . ,k} for any 1 < h < k as 
long as two sides of the table are variable; we omit the proof of this extended result. 

Corollary 6.2 It is NP-complete to decide, given r, c, and line-sums u £ 1/'^'^, 
V e Z^^^, and z e Z'^^^, encoded as [{u, v, z)], if the following set of tables is nonempty, 

i j k 

Proof. The integer programming feasibility problem is to decide, given A G Z™^" and 
b e Z™, if {y e N'^ : Ay = b} is nonempty. Given such A and 6, the polynomial time 
algorithm of Theorem 6.1 produces r, c and u e Z''^'^, v e Z''^^, and z e Z"^^^, such that 
{y e : Ay — b} is nonempty if and only if the set S above is nonempty. This reduces 
integer programming feasibility to short 3-way line-sum transportation feasibility. Since 
the former is NP-complete (see e.g. [55]), so turns out to be the latter. □ 

We now show that in contrast, when all sides but one are fixed (but arbitrary), and 
one side n is variable, then the corresponding long A;-way transportation problem for any 
hierarchical collection of margins is an n-fold integer programming problem and there- 
fore, as a consequence of Theorem 4.12, can be solved is polynomial time. This extends 
Corollary 4.13 established in §4.5.1 for 3-way line-sum transportation. 

Corollary 6.3 For every fixed k, table sides mi, . . . ,mk, and family T of subsets of 
{l,...,k + 1}, there is a polynomial time algorithm that, given n, integer values 
u = (Mii,...,ij._,_J for all margins supported on T , and integer mi x • • ■ x m^ x n array 
w, encoded as [{u,w)], solves the linear integer multiway transportation problem 

maxjwx : 2; e N™i^-^'"'=^", x^^^i,^^ = iin,...,i,+i, supp(ii, . . . , ife+i) e J^} . 
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Proof. Re-index the arrays as x — {x^,...,x"-) with each x^ = (a;ji,...,jj,j) a suitably 
indexed mim2 ■ ■ ■ rrik vector representing the j-th layer of x. Then the transportation 
problem can be encoded as an n-fold integer programming problem in standard form, 

max{wx : x e N"*, A^")^ = b} , 

with an (r + s) X t defining matrix A where t := m\m2 ■ ■ ■ rrik and r, s, Ai and A2 are 
determined from J^, and with right-hand side b :— (6°, 6^, . . . , 6") e 'p+ns determined 
from the margins u — (tiii,...,ij._^J, in such a way that the equations ■AiiYl]=i^'') ~ ^° 
represent the constraints of all margins a;jj_ .._jj._+ (where summation over layers occurs), 
whereas the equations A2X^ = V for j = 1, . . . , n represent the constraints of all margins 
Xii,...,if,,j with j ^ + (where summations are within a single layer at a time). 

Using the algorithm of Theorem 4.12, this n-fold integer program, and hence the given 
multiway transportation problem, can be solved in polynomial time. □ 

The proof of Corollary 6.3 shows that the set of feasible points of any long fc-way 
transportation problem, with all sides but one fixed and one side n variable, for any 
hierarchical collection of margins, is an n-fold integer programming problem. Therefore, 
as a consequence of Theorem 5.6, we also have the following extension of Corollary 6.3 
for the convex integer multiway transportation problem over long k-way tables. 

Corollary 6.4 For every fixed d, k, table sides mi, . . . ,mk, and family T of subsets 
of {1, . . . , k + 1}, there is a polynomial time algorithm that, given n, integer values 
u — (iiii,...,i;i._,_J for all margins supported on T , integer m\ x • • • x x n arrays 
wi, . . . ,Wd, and convex functional c : M.^ — > M presented by a comparison oracle, en- 
coded as [{u, wi, . . . , Wd)], solves the convex integer multiway transportation problem 

max{c{wix,...,wdx) : a; e M^^ix-xm^xn ^ 

Xh,...,ik+i = Uii,-,ik+i ' supp(ii, . . . ,ife+i) e J^} . 

6.4 Privacy and Entry-Uniqueness 

A common practice in the disclosure of a multiway table containing sensitive data is to 
release some of the table margins rather than the table itself, see e.g. [11, 18, 19] and 
the references therein. Once the margins are released, the security of any specific entry of 
the table is related to the set of possible values that can occur in that entry in any table 
having the same margins as those of the source table in the data base. In particular, if this 
set consists of a unique value, that of the source table, then this entry can be exposed and 
privacy can be violated. This raises the following fundamental entry-uniqueness problem: 
given a consistent disclosed (hierarchical) collection of margin values, and a specific entry 
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index, is the value that can occur in that entry in any table having these margins unique? 
We now describe the results of [48] that settle the complexity of this problem, and interpret 
the consequences for secure statistical data disclosure. 

First, we show that if two sides of the table are variable part of the input then the 
entry-uniqueness problem is intractable already for short 3- way tables with all 2-margins 
(line-sums) disclosed (corresponding to — {{1, 2}, {1, 3}, {2, 3}}). This can be easily 
extended to k-waj tables of any dimension A; > 3 and the collection of all /i-subsets 
of {1, . . . ,k} for any 1 < h < k as long as two sides of the table are variable; we omit 
the proof of this extended result. While this result indicates that the disclosing agency 
may not be able to check for uniqueness, in this situation, some consolation is in that an 
adversary will be computationally unable to identify and retrieve a unique entry either. 

Corollary 6.5 It is coNP- complete to decide, given r, c, and line-sums u e Z^^*^, 
V e Z''^^, z e Z'^^^, encoded as [{u,v,z)], if the entry Xi i^i is the same in all tables in 

{x G N : ^ ^ ^i,j,k — Zj,k 1 ^ ] ^i,j,k — t^i,k i ^ ^ ^i,j,k — } ■ 

i j k 

Proof. The subset-sum problem., well known to be NP-complete, is the following: given 
positive integers ag, ai, . . . , a„i, decide if there is an / C {1, . . . , m} with Oq = J^ie/ 
reduce the complement of subset-sum to entry-uniqueness. Given oq, ai, . . . , am, consider 
the polytope in 2(m + 1) variables yo,yi. . ., ym, zo, zi, . . . , Zm, 

m 

P := {{y, z) e M+^"'"^^^ : ao|/o - 0,7/^ = 0, yi + Zi = 1 , i = 0,1 . . . ,m} . 

1=1 

First, note that it always has one integer point with yo = 0, given by |/j = and Zi = 1 
for all i. Second, note that it has an integer point with yo 7^ if and only if there is 
an / C {1, . . . ,m} with oq = J2iei^i^ by yo = 1, = 1 for i e I, yi ^ for 

i e {1, . . . , m} \ I, and Zi = 1 — yi for all i. Lifting P to a suitable r x c x 3 line-sum 
polytope T with the coordinate yo embedded in the entry Xi^i^i using Theorem 6.1, we 
find that T has a table with Xi^i^i = 0, and this value is unique among the tables in T if 
and only if there is no solution to the subset-sum problem with ao,ai, . . . ,am- □ 

Next we show that, in contrast, when all table sides but one are fixed (but arbitrary), 
and one side n is variable, then, as a consequence of Corollary 6.3, the corresponding 
long fc-way entry-uniqueness problem for any hierarchical collection of margins can be 
solved is polynomial time. In this situation, the algorithm of Corollary 6.6 below allows 
disclosing agencies to efficiently check possible collections of margins before disclosure: if 
an entry value is not unique then disclosure may be assumed secure, whereas if the value 
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is unique then disclosure may be risky and fewer margins should be released. Note that 
this situation, of long multiway tables, where one category is significantly richer than 
the others, that is, when each sample point can take many values in one category and 
only few values in the other categories, occurs often in practical applications, e.g., when 
one category is the individuals age and the other categories are binary ("yes-no"). In 
such situations, our polynomial time algorithm below allows disclosing agencies to check 
entry-uniqueness and make learned decisions on secure disclosure. 

Corollary 6.6 For every fixed k, table sides mi, . . . ,mk, and family T of subsets of 
{!,..., A; -|- 1}, there is a polynomial time algorithm that, given n, integer values 
— (""iiv-jfe+i) /^'^ margins supported on T, and entry index (ii, . . . , i^+i), 
encoded as [n, decides if the entry Xi^^,,,^i^^^ is the same in all tables in the set 

^^ra,.-.m,.n . ^^^^^^^^^ ^ SUpp(ii, . . . , J.+i) ^ ^ } ■ 

Proof. By Theorem 6.3 we can solve in polynomial time both transportation problems 

I := minja;,,,...,,,^, : ^ g , x ^Tr) , 

u := max{xi,,...,,,^, : xeN'"l"•••^'"'="^ x e T^} , 
over the corresponding /c-way transportation polytope 

Tr:={x^ Krn,x...x^,xn . ^^^^^^^^^ ^ ^^^^^^^^^ ^ ^^pp(_^.^^ ^^^^^^ ^ ^ I _ 

Clearly, entry ,Xjj has the same value in all tables with the given (disclosed) margins 

if and only if / = ti, completing the description of the algorithm and the proof. □ 
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